Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0617 |
Symbol | |
ID | 7406958 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 700500 |
End bp | 701699 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643714998 |
Product | ROK family protein |
Protein accession | YP_002572514 |
Protein GI | 222528632 |
COG category | [G] Carbohydrate transport and metabolism [K] Transcription |
COG ID | [COG1940] Transcriptional regulator/sugar kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTAACC ACACGCTACT AAAGCAAATA AACAAGCTTT TGGTTTTGAA AACAATTTTG GACAACAAGA TAATATCTCG TACAAAGATA TCGAAACTTG TAGATTTAAA TAAAGCAACA GTCTCAAACC TCACCGATGA GCTCATAAAA GAAGGGTATG TAGTAGAAAA AGGATACGGT AAGTCCAAAG GCGGAAGAAG ACCTGTCCTT TTACAGGTAA ACAAAGATGT AGGTTCAATC ATCGGAATTG ACTTAGGTGT TGACTATATT CATATTATTC TCTCAAACTT TGTGGGAGAA GTTATTTTTG AAGAATATGC CAACATGAAA ATAGGTGAGG ATAAGGAAAA ACTTTTAAGG CTTCTTTTTG ACCTGATTGA AAAATCAGTA AAAAAGGCGC CACAAACTCC AAAAGGGATT TTAGGTATTG GAATTGGTGT TCCAGGTATT ATAGAAAAAG AGTCTGGAAC TGTCCTTCTT GCTCCCAATT TGAAATGGCA AAATGTCCCT TTGAGGTCAA TTGTTCAGCA AAAGTTCAAC CTCCCTGTTT ATATTGACAA TGAAGCAAAT GCAGGCGCAC TGGGCGAAAA GTGGTTTGGT GAGTGGGGAA AAGTTAGTGA TTTGATTTAT TTGAGTGTTG GAATTGGGCT TGGTGCAGGA ATTATTATCG ACAACAAACT TTTCAGAGGT GCTGCAGGAT TTGCAGGTGA AGTTGGACAT ACCACTATCA ACTTTCAGGA CGATGTTTGC AGCTGCGGCA ATATTGGCTG TCTTGAGAAC TTTGCATCCG AGAGGGCACT TTTGAGTGTT ATAAAAAAGC TTGTAAAACA AGGAGTAGAG GATAGGTATA TAAGCTGGGA AAATGTAGAT GAAATCACTC CTTCTCGGAT AATACAAGCA GCAAAAGAAG GAAGCAGAGT TTGCAGAATG GCTATACTTG AGGTTGCTGA AAAGATGGGG ATAGGCGTGG CAAATCTTGT AAATATTTTT AATCCTGAAA TGGTAATTAT AGGTAATAAG GCATCATTTT TTGGAGAGTT GTTTTTAGAA AAATTGAGGG AAGTAATTAA CCAGAGATCC TTTATCGCCC AGTTTTATAA TCTTAAGATA GAGGTTTCAA AACTGAAAGA CAGAGCTGTG GTGCTGGGAT GTATAGCTAT GGTGATTTCC GACATGCTTT CTTTTCCAGA ATATGCATGA
|
Protein sequence | MGNHTLLKQI NKLLVLKTIL DNKIISRTKI SKLVDLNKAT VSNLTDELIK EGYVVEKGYG KSKGGRRPVL LQVNKDVGSI IGIDLGVDYI HIILSNFVGE VIFEEYANMK IGEDKEKLLR LLFDLIEKSV KKAPQTPKGI LGIGIGVPGI IEKESGTVLL APNLKWQNVP LRSIVQQKFN LPVYIDNEAN AGALGEKWFG EWGKVSDLIY LSVGIGLGAG IIIDNKLFRG AAGFAGEVGH TTINFQDDVC SCGNIGCLEN FASERALLSV IKKLVKQGVE DRYISWENVD EITPSRIIQA AKEGSRVCRM AILEVAEKMG IGVANLVNIF NPEMVIIGNK ASFFGELFLE KLREVINQRS FIAQFYNLKI EVSKLKDRAV VLGCIAMVIS DMLSFPEYA
|
| |