Gene Athe_0107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0107 
Symbol 
ID7408469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp130693 
End bp131814 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content39% 
IMG OID643714515 
ProductMonosaccharide-transporting ATPase 
Protein accessionYP_002572038 
Protein GI222528156 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4214] ABC-type xylose transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTTGA AAAAAAACTT GCGAACTTAC ACTCTCATAA TTGCAATTCT CCTTATATGG 
ACAATATTTA CAGTACTTAC TGATGGGAAT TTTCTAACAC CAAGAAATCT TTCAATGCTT
GCAAGACAGA TGGCAATCAC AGCTCTTGTT GCAATTGGTA TGGTATTTGT AATTGTTGCA
GGGCACATTG ACCTTTCGGT TGGTTCTGTT GTCGGATTTA CTGGTGCTAT TGCTGGTGTT
TTGCAGGTTT GGAATGGGTG GTCAACTCCT GCTACAGTTA TTGCAGTTTT GATAGTTGGT
ATTATAATTG GTATATGGCA AGGATACTGG GTTGCATACA GGGGCGTTCC AGCATTCATT
GTTACGCTGG CAGGAATGCT TGTGTTCAGA GGCGGTGTGC TTTTAGCAAG CAAGGGTATC
ACAATATCAC CTTTTAAAGA TAGTTTTAGA TTTATCGGGC AAGGATATTT GAATAAAGCT
TTGAGCATTG CATTTGGGGC TGTTTTAATT GTTGGGTATC TTCTTTTAAC AATTAGCCAG
AGAAATAGAA GGAAAAAATA CAACTTAGAA GTTTTGCCAA TGGGCTTGGA GATTGCAAAA
GCTGCAGTTG TAATTGCTCT TATTGTTGCA TTTACGGGCG TTATGATAAG CTATGAGGGA
ATTTCTATTC CTGTTCTGAT ACTTGTTGTG TTTACAATCC TGCTAACATT TGTTTCTCAG
AACACAACAT TTGGAAAATA TGTGTATGCA ATAGGTGGAA ACAAAGAAGC AGCAAGTCTT
TCGGGTATAA ACATTAGAAA TGTGACAATG AAGATTTTCA TTCTCATGGG ATTTTTATCG
GCACTGGCAG GAATTGTATT AACATCAAGA CTTGACGCTG CAACATCTGG TGCTGGAACA
AATATGGAGC TTGATGCAAT TGCTGCTGCA ATCCTCGGTG GAACAAGCAC ACTTGGCGGT
GAAGGAACAG TTCCGGGTGC TATCATAGGT GCTTTAATTA TGGCAAGCAT AGACAACGGT
ATGAGCCTTT TGAACTTGGA ATATTCATAT CAGCTGATTG TAAAAGGACT TGTTCTTGTA
TTTGCAGTTT GGCTTGATAT TATGTCAAGA AAGAAAGCAT AA
 
Protein sequence
MNLKKNLRTY TLIIAILLIW TIFTVLTDGN FLTPRNLSML ARQMAITALV AIGMVFVIVA 
GHIDLSVGSV VGFTGAIAGV LQVWNGWSTP ATVIAVLIVG IIIGIWQGYW VAYRGVPAFI
VTLAGMLVFR GGVLLASKGI TISPFKDSFR FIGQGYLNKA LSIAFGAVLI VGYLLLTISQ
RNRRKKYNLE VLPMGLEIAK AAVVIALIVA FTGVMISYEG ISIPVLILVV FTILLTFVSQ
NTTFGKYVYA IGGNKEAASL SGINIRNVTM KIFILMGFLS ALAGIVLTSR LDAATSGAGT
NMELDAIAAA ILGGTSTLGG EGTVPGAIIG ALIMASIDNG MSLLNLEYSY QLIVKGLVLV
FAVWLDIMSR KKA