Gene Arth_3167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3167 
Symbol 
ID4444227 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3557263 
End bp3559176 
Gene Length1914 bp 
Protein Length637 aa 
Translation table11 
GC content64% 
IMG OID639690993 
ProductS-layer domain-containing protein 
Protein accessionYP_832645 
Protein GI116671712 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTTAA TAAAACGCCC CGCCCGCACC CGCTACCTCG TTGTGGCAGC CCTCGTGGCC 
GGGGCGTCAA CTCTGGCCGG TCCGGCAGTT CCGGCGCAGG CGTCAGAAGC CTGCGTCCTC
GTTCCCAACG GGGCTTACAC ATGTTTCGAC CGCGGCGACG AAGGTGCCGG CGAATGTCCC
GGTGGCGCGT GGGTCGGCCC GGACAACGTG GTGCACTACG GTCCTCAACC GGAGCGGTTC
CAGAACTGGC TGTTCGCCCC GTCACCGCTG ATTGCGGGCC GTCCCGCCGT CGGAAGGACG
CTTACGGTCC AGCCCTGCAA CTGGGTTCCG GACCCCGTCA CGCTGCACGT CCAGTGGAAG
CGGAACGGGC GTGCCATTCC CGGAGCGACC GGACAGTCCT ACACGCTGAC CGAGTCGGAC
CAGGGCAAAG CCATAACGGT GACCGTGACC GGATCGAAGG CAGCCTTCGC CAGTCTGGCG
CGGACCAGCG CACCCACGGC CGTTATTGCT CTCGCTGAGT CATACGTCGA TCCGCCGGCG
CCGGAAATCG TCGTCGGAAA CTCCGTCACG AGCGTCGGCA CGCGGCTGTA CGCGACCAAT
ACTTGGCGGT ACACCGAGCT GTCATATTCC CACCAGTGGA AGCGCGACGG TGCAGCTGTG
GACGGAGCCA CTGGCGAAGA GTACGTCCTG ACGACGGCGG ACATCGGCCA CAAAATGACG
GTGACCCGGA CGGGCTGGCG CCCTGGATGG GCCACGGTGA CAAAGTCCAG CGCCCAGTCC
GACGTCGTGG TGGCGGGTCC CCCGCCGTCG TTCACCCTGC CGCCGGACAC CACGGCACCG
GTGGTGGCTC CCTCCCTTGC CCAGGGAACG TACCTCTACG GCCAGGCACT TTCCCTCGCG
GCAGATGAGA CCGCTGACAT CTTCTACACA ACAGACGGTT CGACGCCGAC GCAAAGCAGC
GGCCGTTACA GCGGCCCCTT CTCCCTGGAC AGGACCACCG AGCTTCGTTT CATCGGCATT
GATCTGGCGG GAAATGTCTC GGCGGCCGCC ACCCAGGTTT ACTCGATCAA GCCCGAGCCG
GTCGTGGACT CCGCCGCCCC CTCAATGTCC ATGTCCCTCC AGAACGGGAC CTACCTGACC
GGGCAGCGGC TGGAGATTTC GACTGACGAG GTGGCAGCCG TCTACTACAC CACTGATGGA
AGCACTCCGA CAACCGCCTC CTCCGAGTAT TCGTCGCCGC TCAAATTGGA AGCAAGCGCG
GCCTACCGGT TCATTGCCGT TGACACCAGC GGCAACTCGT CCGTTCCGGT GAGCCGAAGC
TTCGTAGTAC AAACGCCGGT GTTTGACGAC ATCGGGCCGG GAACCCAGTT CTTCGCCGAG
ATCAGCTGGC TCGCCAAGGA AGGCATTTCA ACGGGCTGGG ACGAACGGGG CTCCCGCACA
TACCGGCCCG TGCAGCCCGT GAACCGGGAC GCGATGGCGG CGTTCATGTA CCGGCTGGCC
GGGTACCCGC CGTTTATCCC TCCGGCCGCA TCCCCGTTTA CTGACATCGC ACCCGGGAAC
CAGTTCTACA AGGAAATTAC CTGGCTGGCC TCAACGGGAA TCTCCACCGG GTGGGACGAA
GGCAACGGAC GCCGGAGCTA CCGTCCCCTG CAGCCCGTCA ACCGGGATGC GATGGCAGCC
TTCATGTACA GGTTCGCCGG CAGCCCCGAT TTTGGCGCGT CACCCCTTTC GCAATTCACG
GATGTCGCCG GCTCACCGTT CTACAAGGAG ATCAGCTGGT TCGCTGACAA AGGCATATCC
ACGGGATACA CGGAACCGAA CTGGACGCGG ACCTACCGCC CGCTGAAGCC GGTCAACCGC
GACGCCATGG CTGCTTTCAT GTACCGCCTC CACAGCGCCT TCGGCACTAA GTAG
 
Protein sequence
MALIKRPART RYLVVAALVA GASTLAGPAV PAQASEACVL VPNGAYTCFD RGDEGAGECP 
GGAWVGPDNV VHYGPQPERF QNWLFAPSPL IAGRPAVGRT LTVQPCNWVP DPVTLHVQWK
RNGRAIPGAT GQSYTLTESD QGKAITVTVT GSKAAFASLA RTSAPTAVIA LAESYVDPPA
PEIVVGNSVT SVGTRLYATN TWRYTELSYS HQWKRDGAAV DGATGEEYVL TTADIGHKMT
VTRTGWRPGW ATVTKSSAQS DVVVAGPPPS FTLPPDTTAP VVAPSLAQGT YLYGQALSLA
ADETADIFYT TDGSTPTQSS GRYSGPFSLD RTTELRFIGI DLAGNVSAAA TQVYSIKPEP
VVDSAAPSMS MSLQNGTYLT GQRLEISTDE VAAVYYTTDG STPTTASSEY SSPLKLEASA
AYRFIAVDTS GNSSVPVSRS FVVQTPVFDD IGPGTQFFAE ISWLAKEGIS TGWDERGSRT
YRPVQPVNRD AMAAFMYRLA GYPPFIPPAA SPFTDIAPGN QFYKEITWLA STGISTGWDE
GNGRRSYRPL QPVNRDAMAA FMYRFAGSPD FGASPLSQFT DVAGSPFYKE ISWFADKGIS
TGYTEPNWTR TYRPLKPVNR DAMAAFMYRL HSAFGTK