Gene Athe_2266 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2266 
Symbol 
ID7407685 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2402410 
End bp2403909 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content39% 
IMG OID643716632 
ProductABC transporter related 
Protein accessionYP_002574111 
Protein GI222530229 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000122283 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGCT CCACTGATAG CGAACTTTTA AGGCTGCATG GAATTACAAA GATTTTTCCA 
GGGACAGTTG CACTTTCTGA TGTTTCGTTT TCTGTAAACA AAGCTGAAAT TCATGCGATA
GTTGGTGAAA ACGGTGCTGG AAAATCAACT TTGATGAATA TCATATCAGG AAGTCTTCTT
CCAGACAAAG GTGAGATATA CCTTGAAAGC AAGAAAGTCA ACATAAGGTC ACCAAGGGAT
GCACAAAATC TTGGCATTAG CATAGTTCAC CAGGAGCTTG CGCTCTGTCC GCATCTTACT
GTTGCTGAGA ATATATATAT AGGGAGGCTT CCAGAAAAGT CAGCAAAGAT TGTGGATTAT
AAGACGCTCA ACAAGATGTC GCAAGAGGTC TTGTCTTTGT TTGATGAGGT GAACATAAAG
CCAACTGACA AGGTTGCAAA CTTGAACGTT GCGCAACAGC AGATTGTTGA GATTGCAAAG
GCAATTACAT TTAACTGCAA ACTTTTGATT TTGGATGAGC CAACATCGGC TTTATCTGAA
GCTGATGCAG CAGTGCTCTT TAAAATCATA AAAGATTTAA AGGCAAAAGG AATAAGCATT
CTTTACATTT CTCACAGGCT ACGTGAGATT TTTGAGCTTG CAGACAGAAT CACTGTCCTT
CGCGATGGCA GATACATCAA AACGCTGAAT ACGTCTGAGA CAAACCCTGA CCAGGTTGTA
AGCCTGATGG TGGGAAGAGA AATCAAAGAG ATGTACCCGC CAAAGAGCAC ATATGTTGGC
AAAGAGATCT TCAGGGTGGA AAACATTAGC TCTGACAAGG TTTACAATGT TTCATTTTCG
CTCAGAGAAG GTGAAATTTT AGGTTTTGCA GGGCTTGTAG GAAGCGGAAG AACTGAGCTT
GCCCAGACAA TCTGCGGAAT TTTGCCAAAG CATTGTGGGG AAATTTATCT TGAAGGAAAA
AGGATTGAGA TAAGCTCGTT TGAAGATGCT ATCTGGCATA AAATCGGTTA TGTTACAGAA
GACAGAAAAC AGTTTGGTCT TTTTCTAAAA CTTCCTGTTG CGCACAATGT CTCGGCAATA
CATCTCAAAT ATGATTACAA AAAGTTTTTG ATTGACAAAA ATCATGAACT TTCGCTTGCA
GATAAGTATG TTAAAAAACT AAATGTAAAA ACTTCATCAT ATGTACAGCT TGTTATGAGT
CTTTCAGGTG GGAATCAGCA AAAGGTAATG ATAGCAAAAT GGCTTGCTAT AAATCCAAAG
ATTTTGATTT TGGATGAGCC TACACGTGGA ATAGATGTTG GTGCAAAGGC AGAAATTCAT
AGTCTTTTAA GAGAGCTTGC AAAAAACGGA ATAGGGATTA TTCTAATCTC ATCTGAGCTT
CCCGAGATAA TTGGAATGTG TGACAGGGTA CTTGTCATGA GAGAAGGAAG GATAACTGGA
GAACTCTCTG GAGAAAAAAT CACAGAAGAG AACATCATGC AGCTTGCTGC ACACAAGTAA
 
Protein sequence
MKSSTDSELL RLHGITKIFP GTVALSDVSF SVNKAEIHAI VGENGAGKST LMNIISGSLL 
PDKGEIYLES KKVNIRSPRD AQNLGISIVH QELALCPHLT VAENIYIGRL PEKSAKIVDY
KTLNKMSQEV LSLFDEVNIK PTDKVANLNV AQQQIVEIAK AITFNCKLLI LDEPTSALSE
ADAAVLFKII KDLKAKGISI LYISHRLREI FELADRITVL RDGRYIKTLN TSETNPDQVV
SLMVGREIKE MYPPKSTYVG KEIFRVENIS SDKVYNVSFS LREGEILGFA GLVGSGRTEL
AQTICGILPK HCGEIYLEGK RIEISSFEDA IWHKIGYVTE DRKQFGLFLK LPVAHNVSAI
HLKYDYKKFL IDKNHELSLA DKYVKKLNVK TSSYVQLVMS LSGGNQQKVM IAKWLAINPK
ILILDEPTRG IDVGAKAEIH SLLRELAKNG IGIILISSEL PEIIGMCDRV LVMREGRITG
ELSGEKITEE NIMQLAAHK