Gene Athe_0029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0029 
Symbol 
ID7407264 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp37443 
End bp38738 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content26% 
IMG OID643714440 
ProductABC transporter ATP-binding protein 
Protein accessionYP_002571965 
Protein GI222528083 
COG category[S] Function unknown 
COG ID[COG4938] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATTAT CAGTAAAAAA CGTAGGAATT ATTGAAACTG CTAATATAAA GTTAAACGGA 
CTAACCGTGT TAACAGGCGA AAATGACACT GGGAAAACTA CAATTGGTAA AATTTTATTT
ACTATTTATT TTGGCTTTAA TGATTTTATG AAAAACATGG AAAAATATAA AGTACAAATA
TTTAGAGATG ACTTGATAAG AATATTTAAA TTAATAAGGA TGGTTAAAAT TCCTTTTGAA
AGAAAATTTA GAAATCTAAT TCTTGATGAT TTTCTTGCCT TCGGAAAAAA CAATGAATTA
CTTTCATGGC TCATGGAATT AAAAAATGCA GTAGAAAATT CAAATGGCAT TGACAAAAAC
TTAGAAAAAG AAATTGTGAA ATATATCAAT CAAGCAATAG AAAAAATTAA CTTATTAGAA
GATAAAGAAA CTCTTAAAAA ATTTGCTCTT GACAAGATTT TAAATAGAGA ATTTTACGGA
CAGATAAATA ATCTTTTTTT TGAAAAACCT GCACAAATAA TTATTTGTCA AGAAGATGAA
AACGAAGCAT TTATCAGTAT TGTTGAAAAT TCAATATCAG AATGTTTTAT TAATGATTTT
TTATCTTTCA AAGATGCTAC TTTTTTAGAT TTAGCTATAG ATGCTTTTCC TTTAGCTATT
GATATTTTTC CTTTCGATCC TTTTGATTTT AGTACTCATG GAAATAATTT AAAATCGAAA
ATTTTTTTTA AAATACCAAA TGAGAATATC TTAGGCTTAT ATTTGAATAC AAAGAAACAT
AATCTAGAAC CTATTGAGAA CATTTTTAGA AATGTTCTTA CAGGAAACAT TGTCAAGAAA
ATTGATGGTA AAATTTTGGA ATATAATATC AATGGGAAAA ATATAAAAAT AGAAAACTTG
GCATCTGGTT TAAAAGTATT CATAGTTCTA AGATTGTTAT ATGAAAATGG CTATATATCA
AGAGAATCTC TATTAATAAT CGATGAACCT GAAACTCATC TGCATCCTAA ATGGCAGTTA
GACTGCGCAG AACTTTTAAC ATTATTCGTC AAAGAATTAG AAGCTAACAT TTTGTTAATT
TCCCATAGCC CATATTTTAT TGAAGCTATT GAAGTTTTTA GTGAACATTA TAAAATTGAA
CATAAGACAA ATTTTTATTT AGCAACAAAG AAAGAACCGC ACTTTGTAGT TTTTGAAAAT
GTAAATCAAC AATTAGAAAA AGTTTATGAA CTTCTTTCAT TCCCTTTTGA TAAATTAGAA
GAAATAAGAG AGAGGGATAT GATGAATGGA AATTGA
 
Protein sequence
MELSVKNVGI IETANIKLNG LTVLTGENDT GKTTIGKILF TIYFGFNDFM KNMEKYKVQI 
FRDDLIRIFK LIRMVKIPFE RKFRNLILDD FLAFGKNNEL LSWLMELKNA VENSNGIDKN
LEKEIVKYIN QAIEKINLLE DKETLKKFAL DKILNREFYG QINNLFFEKP AQIIICQEDE
NEAFISIVEN SISECFINDF LSFKDATFLD LAIDAFPLAI DIFPFDPFDF STHGNNLKSK
IFFKIPNENI LGLYLNTKKH NLEPIENIFR NVLTGNIVKK IDGKILEYNI NGKNIKIENL
ASGLKVFIVL RLLYENGYIS RESLLIIDEP ETHLHPKWQL DCAELLTLFV KELEANILLI
SHSPYFIEAI EVFSEHYKIE HKTNFYLATK KEPHFVVFEN VNQQLEKVYE LLSFPFDKLE
EIRERDMMNG N