Gene Athe_0617 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0617 
Symbol 
ID7406958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp700500 
End bp701699 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content38% 
IMG OID643714998 
ProductROK family protein 
Protein accessionYP_002572514 
Protein GI222528632 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTAACC ACACGCTACT AAAGCAAATA AACAAGCTTT TGGTTTTGAA AACAATTTTG 
GACAACAAGA TAATATCTCG TACAAAGATA TCGAAACTTG TAGATTTAAA TAAAGCAACA
GTCTCAAACC TCACCGATGA GCTCATAAAA GAAGGGTATG TAGTAGAAAA AGGATACGGT
AAGTCCAAAG GCGGAAGAAG ACCTGTCCTT TTACAGGTAA ACAAAGATGT AGGTTCAATC
ATCGGAATTG ACTTAGGTGT TGACTATATT CATATTATTC TCTCAAACTT TGTGGGAGAA
GTTATTTTTG AAGAATATGC CAACATGAAA ATAGGTGAGG ATAAGGAAAA ACTTTTAAGG
CTTCTTTTTG ACCTGATTGA AAAATCAGTA AAAAAGGCGC CACAAACTCC AAAAGGGATT
TTAGGTATTG GAATTGGTGT TCCAGGTATT ATAGAAAAAG AGTCTGGAAC TGTCCTTCTT
GCTCCCAATT TGAAATGGCA AAATGTCCCT TTGAGGTCAA TTGTTCAGCA AAAGTTCAAC
CTCCCTGTTT ATATTGACAA TGAAGCAAAT GCAGGCGCAC TGGGCGAAAA GTGGTTTGGT
GAGTGGGGAA AAGTTAGTGA TTTGATTTAT TTGAGTGTTG GAATTGGGCT TGGTGCAGGA
ATTATTATCG ACAACAAACT TTTCAGAGGT GCTGCAGGAT TTGCAGGTGA AGTTGGACAT
ACCACTATCA ACTTTCAGGA CGATGTTTGC AGCTGCGGCA ATATTGGCTG TCTTGAGAAC
TTTGCATCCG AGAGGGCACT TTTGAGTGTT ATAAAAAAGC TTGTAAAACA AGGAGTAGAG
GATAGGTATA TAAGCTGGGA AAATGTAGAT GAAATCACTC CTTCTCGGAT AATACAAGCA
GCAAAAGAAG GAAGCAGAGT TTGCAGAATG GCTATACTTG AGGTTGCTGA AAAGATGGGG
ATAGGCGTGG CAAATCTTGT AAATATTTTT AATCCTGAAA TGGTAATTAT AGGTAATAAG
GCATCATTTT TTGGAGAGTT GTTTTTAGAA AAATTGAGGG AAGTAATTAA CCAGAGATCC
TTTATCGCCC AGTTTTATAA TCTTAAGATA GAGGTTTCAA AACTGAAAGA CAGAGCTGTG
GTGCTGGGAT GTATAGCTAT GGTGATTTCC GACATGCTTT CTTTTCCAGA ATATGCATGA
 
Protein sequence
MGNHTLLKQI NKLLVLKTIL DNKIISRTKI SKLVDLNKAT VSNLTDELIK EGYVVEKGYG 
KSKGGRRPVL LQVNKDVGSI IGIDLGVDYI HIILSNFVGE VIFEEYANMK IGEDKEKLLR
LLFDLIEKSV KKAPQTPKGI LGIGIGVPGI IEKESGTVLL APNLKWQNVP LRSIVQQKFN
LPVYIDNEAN AGALGEKWFG EWGKVSDLIY LSVGIGLGAG IIIDNKLFRG AAGFAGEVGH
TTINFQDDVC SCGNIGCLEN FASERALLSV IKKLVKQGVE DRYISWENVD EITPSRIIQA
AKEGSRVCRM AILEVAEKMG IGVANLVNIF NPEMVIIGNK ASFFGELFLE KLREVINQRS
FIAQFYNLKI EVSKLKDRAV VLGCIAMVIS DMLSFPEYA