Gene Athe_2268 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2268 
Symbol 
ID7407687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2404986 
End bp2406473 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content37% 
IMG OID643716634 
ProductABC transporter related 
Protein accessionYP_002574113 
Protein GI222530231 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAATTG TCTTAGAAAA CGTATCAAAG AAGTTTGAAG AATTTTATGC CTTAAAGAGA 
ATTAACCTTG AATTTTTTGA GGGCGAAGTG CATGTAATAT TAGGTAAAAA CGGTTCTGGC
AAGACTACTC TAACAAAGGT TATCAACGGA ACATACCAGC CATCTGAAGG AAAAGTTATA
ATAAACGGCA AGGCTTTTTC AAGTCTAAAT CCGCTTTTGG CAAAGTCGCT TGGGATTTTC
ACTGTGCATC AGGAACTTTT GTACTTTCCG CACCTTTCTG TTGCAGAAAA TATCTTCATT
GAAAACAAGC CGAAAAATAG CCTTGGATTT ATTAGCACTA AAAAGATGAT AAAGCTCTCT
TCTGAAATCC TTGCTAAAAT GGGTGTTAAC ATAAATCCAG AGGCAAGTTG CAAAAACCTT
GGCACATCTC AGCTTCAGGT GATAGAGATT TGCCGTGCAC TTGTGCAAGA TGCAAAGGCT
ATTATATTCG ATGAGCCAAC AGCCGGTCTT ACTTCGTATG AGATAGAAAA CCTTTTTAGA
ATTATCAAGG AGCTAAAGCA AAAAGGAATC ATCATAATAT TTGTCACAAA TATGGTAGAG
GAAGCACTCT CCATTGCTGA TAGAATAACA ATCCTGCGCG ACGGCGAGGT TGTGGCATCT
GATAAAGTAA CAAGCTTTAA TCTGAACAGG GCAATTTCCC TCATCTTTGG AAGCATAAAC
AGAACATATC CAAAGCTAAA AGTAGAAAAG GGGCCGGTCA TTTTTAGCGT CAAAAATTTA
ACAAAGTCTG GGATAATTGA GGACATATCA TTTGATGTAA GAAGCGGCGA GGTGTATGGA
ATAGCAGGGC TTGTTGGCTC TGGTCGAAGC TTTTTGGCAA AAGCACTTTT TGGTGCTGAA
AAAGTAGACT CGGGAGAGAT TTTTATTAAA GGAAATTTTT TGAAGCTTTC AAGCCCATCT
GATGCAATAA AAAATGGCAT TGCATTTATG AGCGAAGATA GGATTGGCAC AGGACTTTTT
AAGACTTTGA ACATTGCAGA TAACATCATC TCATCTAATA TTTGGAATAT TGTAAATGGG
TTTTTCATTG ACTCATCACG TCAAGAGAAG ATTGCAAATT TTTTTATAAA GCGGCTTGGG
ATAAAGCCAA AGGAAATCTC ACAAAAGATA GCAACTCTAT CTGGAGGTAA TCAGCAAAAG
GTTTTGATCG CCAAGTGGTT ATTTTCCCAG TCTTTAGTAT TTATTCTTGA TGAGCCAACA
AAGGGTTTGG ATTTGGCATC TAAGGTTGAG GTGTACAACA TCATAAACGA GCTTGCGCGA
ATTGGCTGTG CAATTATTTT CATCTCCTCT GAATTTTCAG AGCTAATTGG CATGTGCGAT
AGGATTTTGG TTTTGAGAGA AGGCAAAAAA GTTCATGAGT TTTCTAAGAA GGAAATGGAT
TATGATAGAA TTTTAAAAAG TGCGATGGGA ATTTTAGAAA GCAAATAA
 
Protein sequence
MSIVLENVSK KFEEFYALKR INLEFFEGEV HVILGKNGSG KTTLTKVING TYQPSEGKVI 
INGKAFSSLN PLLAKSLGIF TVHQELLYFP HLSVAENIFI ENKPKNSLGF ISTKKMIKLS
SEILAKMGVN INPEASCKNL GTSQLQVIEI CRALVQDAKA IIFDEPTAGL TSYEIENLFR
IIKELKQKGI IIIFVTNMVE EALSIADRIT ILRDGEVVAS DKVTSFNLNR AISLIFGSIN
RTYPKLKVEK GPVIFSVKNL TKSGIIEDIS FDVRSGEVYG IAGLVGSGRS FLAKALFGAE
KVDSGEIFIK GNFLKLSSPS DAIKNGIAFM SEDRIGTGLF KTLNIADNII SSNIWNIVNG
FFIDSSRQEK IANFFIKRLG IKPKEISQKI ATLSGGNQQK VLIAKWLFSQ SLVFILDEPT
KGLDLASKVE VYNIINELAR IGCAIIFISS EFSELIGMCD RILVLREGKK VHEFSKKEMD
YDRILKSAMG ILESK