Gene Arth_1126 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1126 
Symbol 
ID4446394 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1221811 
End bp1223109 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content63% 
IMG OID639688932 
Productextracellular solute-binding protein 
Protein accessionYP_830620 
Protein GI116669687 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.190587 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCGTA TAACTACCCG AACCCTCCGC AAAACGATCG CCGTAGTTTC TGCGCTGAGC 
CTGCTCGGCC TCACAGCGTG TTCCAGTCCG TCGACCAACC AGCCCGAGGA CGGCAACGTT
GAAATCCGCT TCTCCTGGTG GGGCAACGCG GGCCGCGCCG AGCTGACCAA CAAGGCCATC
AAGGAGTTCG AAGCCGCCAA CCCCACCATC AAGGTCAAAC CTGAGTATGG GGACATTAGC
GGCTACTTCG ACAAACTGGC CACGCAGATG GCCGCCAACG ATGCACCCGA CGTGATCACC
ATGGGCGGCG CCTATCCGGC CGAGTACGCC AACCGCGGCG CACTGCTGGA CCTGTCCAAG
GTCAGCGGCT CCCTGGACCT CTCCAAGACG GACCAGGGAG CCCTGGAGAA CGGCCAGGTG
CAAGGCAAGC AGTACGGGGT CTCCACCGGA GCCAACGCGT TGGCGATCGT CGTGAACCCC
GGCGTCCTGC AGGCCGCCGG CGTTGCGCTT CCGGATGACA GCACGTGGAC CTGGGATGAT
TTCGCCAGGC TGGCTGCCGA CGTGACAGCC AAGAGCCCTA AGGGCACCTA TGGGACGGCA
ACCGTCCTCA CGCACGATTC GCTGGACGCC TTTGCCCGCC AACGCGGCGA GTCGCTCTAC
ACCCAGGACG GGCAGCTCGG ACTGGGCAAG GAGACGGTGC AGGACTACTT CGACTACTCG
GTGAAGCTGA GTGAATCGGG AGCTGCGCCC GGTGCCTCGG AGACCGTGGA AAAGCTAAAC
GTCAGCACCG AGCAGACATT GATGGGCATG GGCAAGGCCG GGATGATGCT CACGTGGAGC
AACTCTCTCG CCGCACTCAG CAAGGCCTCG GGGGCGGACC TGAAGCTTCT TAGGCTCCCC
GGCGAAACCC CGACCCCGGG CATCTGGCTG CAGTCTTCGC AGTTCTACAC GATCTCTGCC
CGCAGCAAGC ACACCGATGC CGCCGCGAAG CTGGTGAACT TCCTGGTGAA CGACGCATCC
GCCGCAAAGA TTATTCAGAG CGACCGCGGC GTGCCCAGTA ACTCCGGAAT GCGCACTGCG
ATCCAGGACA TGCTGACGCC GCAGGGCAAG ATCGAGGCAG CTTACATCGA CCAGATAGGC
AAGATGGACT TTGCGCCTAC CTTCATTGGC CCCACCGGTT CAACAGCCGT CTCCGAGATC
ACGGCCCGCA TCAACACCGA TGTCCTGTTC AAGCGGCTCT CGCCGGAAAA AGCCGCGGAG
CAGTGGATCA GCGAAAGCAA GGCGGCCATC GGCAAGTAG
 
Protein sequence
MARITTRTLR KTIAVVSALS LLGLTACSSP STNQPEDGNV EIRFSWWGNA GRAELTNKAI 
KEFEAANPTI KVKPEYGDIS GYFDKLATQM AANDAPDVIT MGGAYPAEYA NRGALLDLSK
VSGSLDLSKT DQGALENGQV QGKQYGVSTG ANALAIVVNP GVLQAAGVAL PDDSTWTWDD
FARLAADVTA KSPKGTYGTA TVLTHDSLDA FARQRGESLY TQDGQLGLGK ETVQDYFDYS
VKLSESGAAP GASETVEKLN VSTEQTLMGM GKAGMMLTWS NSLAALSKAS GADLKLLRLP
GETPTPGIWL QSSQFYTISA RSKHTDAAAK LVNFLVNDAS AAKIIQSDRG VPSNSGMRTA
IQDMLTPQGK IEAAYIDQIG KMDFAPTFIG PTGSTAVSEI TARINTDVLF KRLSPEKAAE
QWISESKAAI GK