Gene Athe_2080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2080 
Symbol 
ID7408789 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2200890 
End bp2201816 
Gene Length927 bp 
Protein Length308 aa 
Translation table11 
GC content35% 
IMG OID643716447 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_002573930 
Protein GI222530048 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1175] ABC-type sugar transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.136718 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGAATA TTAATCGAAA AAAGTCTTTC AGAGAACTTT TGATGAGTTC AGAAAATGTT 
GCTGGTTATG TTTTTATATC TCCCTGGCTT ATTGGCTTTT TTGTGTTCAC TTTGATTCCT
ATTGCAGCAA CCTTTTATTT GTCTTTTACT CAATATGATT TATTATCATC TCCTAAATTT
GTAGGATTAC AAAATTATGT ACAAATGTTT AAAGAAGATC CGTTATTTTG GAAATCAATG
TCAGTAACTT TTTTCTATGT GTTTGTAACT GTGCCATTAA AGCTGGCTTT TGCATTGCTT
CTTGCCCTTT GGCTTTCTTA CAAAAGCAGA CTAACACCAT TTTACAGGGC TGTATACTAT
GTTCCTTCTA TGATGGGTGG CAGTGTGGCT GTGGCAGTGC TTTGGCAAAG ACTTTTTACA
AGTGATGGTG TTATAAATTC AATATTGAAA CTATTTGGAA TTCAAAGTGA GACTTCATGG
ATAGGAAATC CAAGAACTGC TATATGGACG TTGATATTAC TTGCGGTTTG GCAATTTGGT
TCGCCGATGT TGATATTTTT AGCAGGTTTA AAGCAAATAC CAGAAAGCTA TTATGAAGCA
GCTATTATTG ATGGAGCAAA TAGCTGGCAA AAGTTTGTTA AAATAACCTT GCCGATGCTC
ACACCAATAA TATTTTTCAA CTTGATTATG CAGATGATAG GAAGCTTTAT GACTTTTACT
CAAGGATTCA TTATTACAAA TGGCGGCCCT GTGAACAGCA CACTCTTTTA CGCTATTTAC
CTCTACAGAA GAGCATTCCA ATTTTATGAC ATGGGCTACA GCTGTGCTAT GTCGTGGGTA
ATGCTTATTA TCATTGGAAT ACTCACAGCT TTTATATTCA AATCATCTAC ATTTTGGGTA
TATTATGAGT CCAAGGAAGG TGAATAA
 
Protein sequence
MSNINRKKSF RELLMSSENV AGYVFISPWL IGFFVFTLIP IAATFYLSFT QYDLLSSPKF 
VGLQNYVQMF KEDPLFWKSM SVTFFYVFVT VPLKLAFALL LALWLSYKSR LTPFYRAVYY
VPSMMGGSVA VAVLWQRLFT SDGVINSILK LFGIQSETSW IGNPRTAIWT LILLAVWQFG
SPMLIFLAGL KQIPESYYEA AIIDGANSWQ KFVKITLPML TPIIFFNLIM QMIGSFMTFT
QGFIITNGGP VNSTLFYAIY LYRRAFQFYD MGYSCAMSWV MLIIIGILTA FIFKSSTFWV
YYESKEGE