Gene Athe_2553 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2553 
Symbol 
ID7409504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2671827 
End bp2672753 
Gene Length927 bp 
Protein Length308 aa 
Translation table11 
GC content37% 
IMG OID643716917 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_002574394 
Protein GI222530512 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4209] ABC-type polysaccharide transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.336984 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGTCAT CTAAGAAAGG AATCTTTTCA ACAATCTACA ACCAGCGCCA GCTAATTGTT 
CTTACATTTC CTTTTTTAAT TATGGTGCTA ATTTTTAACT ACTTTCCACT GTGGGGCTGG
CTTTTGGCAT TCAAAGACTA CAAACCGTAT CTTGGTTTTC AAAATTCAGA GTGGGTTGGA
TTTAAAAACT TTGTGGACCT GTTTTCAGAC GTTTATTTTT TCCAGGCGCT TAGAAATACC
TTGGTCATAA GTTTTTTGAA ACTAATATTC AATTTTTTGT CATCCATTAC ATTTGCAATA
CTTTTGAATG AGATTAAAAA CATGCTGTTT AAAAGGACAG TGCAGACAAT CTCCTACCTT
CCACACTTTG TTTCATGGGT TGTAGCAGCA AATATTGTAT ACACAGTGCT TTCACCTGAC
TATGGGATAA TAAACGAACT TCTTGTCAAA TTCCATATTC TAAAAGAACC AATTAACTTT
TTGGGCGAAC CTAAATATTT CTGGTTAATA GCACCAATTA CTGAGGTCTG GAAAGAGATG
GGTTGGAATG CAATTATATA CTTGGCTGCA ATGACTAACA TTGACCCGCA GCTTTATGAA
GCTGCAAGCA TAGACGGTGC AGGCAGGCTA AAGAGAATAT GGTATATAAC CTTGCCAGGT
ATTCTACCCA CAGTCAAGAT TCTTCTTATC ATGAACGTTG GATGGATTTT GAACGCTGGG
TTTGAACAGA TGTACCTTTT GCAAAGACCA TCAACCTTGG ACTATTCAGA TATCCTTGAG
ACATACATTT TGAGGTATGG TATTGGCAGT GGAAGATGGT CATATGCAAC AGCAGCTGGT
ATTTTCAACT CTGTTGTAAG CCTTATTCTT GTTACAACTG CAAACAGAAT AGCATCTAAG
ATTGGTGAAG GCGAAAGGGT ATTTTAA
 
Protein sequence
MQSSKKGIFS TIYNQRQLIV LTFPFLIMVL IFNYFPLWGW LLAFKDYKPY LGFQNSEWVG 
FKNFVDLFSD VYFFQALRNT LVISFLKLIF NFLSSITFAI LLNEIKNMLF KRTVQTISYL
PHFVSWVVAA NIVYTVLSPD YGIINELLVK FHILKEPINF LGEPKYFWLI APITEVWKEM
GWNAIIYLAA MTNIDPQLYE AASIDGAGRL KRIWYITLPG ILPTVKILLI MNVGWILNAG
FEQMYLLQRP STLDYSDILE TYILRYGIGS GRWSYATAAG IFNSVVSLIL VTTANRIASK
IGEGERVF