Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_1654 |
Symbol | |
ID | 6316458 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | - |
Start bp | 1731112 |
End bp | 1733784 |
Gene Length | 2673 bp |
Protein Length | 890 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 642644030 |
Product | CBS domain containing protein |
Protein accession | YP_001917816 |
Protein GI | 188586271 |
COG category | [J] Translation, ribosomal structure and biogenesis [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG0617] tRNA nucleotidyltransferase/poly(A) polymerase [COG0618] Exopolyphosphatase-related proteins [COG2905] Predicted signal-transduction protein containing cAMP-binding and CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.873652 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGTAG TGACATCACA TCAAAATTTA GATTTCGATG GATTAGCCGC AATGTTAGCT TATAATAAAT TGAATCCTGA CACTAAAATG GTACTACCAC CAAAATTAAA TCAAAATGTC AGAGCTTTTT ACTCACTATA CAAAGATACT TTTTCATTTG TTGAAAGAAT CAATTTTGAT TTACATAAAG TAAATACCTT GACCATGGTA GATACTAGTG ATGAAAAACG CATTGGAAAA TTATCATCTA TTTTACCGCA AGTGGAACAG CTAATCATCT ACGACCATCA TGAATACAAA AATCTTTCTC AAAACCCTAC AATAGAGATT AATCATGAAG TAGGGGCAAC AAGTACAATT CTTGTTGAAG AATTAATGAA CAAAGAAATT GAATTATCTC CCTTGGAAGC AACTGTAATA GCCATGGGGA TTTATGAAGA TACAGGGAAC CTGACATTTT CTCATGTTAC TGCTAGGGAT GTGAGAGCCG TTGCCCAACT CCTAGAGTGG GGAGCAGATG TGGATATTAT CAGGGACTAT GTAACCATCA TTTTAACACA AGCTCAAAGA GGTCTATTAG ATGAACTTAT ATCTAATACC AAGTATTTAA ATATTAATGA TTATGAAATT GGACTTTCTA TAACAGAACG GGAAGAATAT TTCAGTGGAG CGGCAGCTTT AGTACATAAG TTAATGGAAA TTGAAGATGT AGACTTATTT TTTATTATAG TAAAAATGGA CAAGAAAGTA TTCGTTATTG GTAGAAGTCG AAGAGAAGAA ATTAATGTCA GCAGGATCTT GTCTCCTTTT AGTGGAAAAG GACATGATCA AGCTGCCTCT GCTACTATAA AACATGGTGA TCTACAACAT ATCGAAACTG AACTTATTGA ATCCTTAAAA TTGCATTTAC CTGTTCTATT AACTGCTAAA GCTATCATGT CAGCTCCAGT TAAAACAATT TCGTCATCTA CTACAATACA AGAAGCTGAT CATTTATTAC ATAAATACGG TCATAGTGGT TTACCAGTAG TACAAGATAA ACAGAATGAT ATTGATGATG AAGCTGGTAA AATTGTTGGA GTTATTTCTA GAAGAGATAT AGAAAAAGCT AAACATCATG GATTTGGTCA TTCTCCGGTC AAGGGGTATA TGTCTCAAAA AGTTATTTCT ATTTCACCGG ACACTTCTAT TAAGGAAATT CAACACTTAA TGGTATCTAA TGATATCGGA CGTTTGCCAG TTATTGATTC AAATGCCAAT TTGATAGGAA TAGTTACTAG AACGAATTTA TTAAAAATAC AGCACGGTCA ATTAACTGAG GAACAAGAGC GAACTAATTT ATACTTAAAA GAACTTGATG AATTTTCTGA AGATATTACT GAATTAATGG CACGTCGGTT ACCTAGCAAA ATTCTTGGAG TTTTATATTT AATTGGCCAA AAGGCAGATA AAGAAGGTTT TAAAGTTTAT GGTGTTGGCG GATTTATAAG AGATTTACTT TTAGGTAAAG ATAATTTGGA TATAGATTTA GTCATAGAAC AAGATGCCAT TAGTTTTGCA AAATTAGTCA GTAAACACTT GAATGGAGAT TTAAAAACAT TTTCTCAATT TCAAACTGCT CGTTTAACCT TGCGTTCAGG CACGAGAATA GATTTTGCCA CAGCTAGAAT AGAATATTAT GCTTTTCCTG CAGCATCACC GGAAGTTGAA GAAAGTACCA TAAAACAAGA TCTATACAGG CGAGATTTTA CCATAAATAC ATTAGCTGTA GAGTTAAATT GTAATTCCTT TGGGAAATTA TTGGATTTCT TTGGTGGAAC AAAAGATCTA AAAAAAGGAG TTATCAGGGT ATTATATAAT CTAAGTTTTG TAGAAGATCC TACAAGAATT TTTCGTGCTA TTAGGTTTGA ATCACGATTC AATTTCAATA TTGAAGAGCA AACTCTAATT TTCATAAAAA ATAGCTTGGA AACAGGTGTA CTTGATAAAC TACCTGGTGA GAGATTGTAT GAAGAACTGC GAAATATAGT TGACGAAGAT GAAGCCGTAA ACACTTTTCG CCGTATGGAT GAACTGGGGA TATTTACAAA AATATTTCCA AATTTAAATG TATCTCAAGA TAAGTTAGTA AAATTGCAAA ACATTTATGA TATTTTGAAC TGGTATGAAC GAGAAAATAA ACAAAAACAA GAAAAGTTTG TCAGTAAAGA AGCTATGGTA TTTTCATGCT TGTTGCAGGA TCAACCGTTA CCGATAATTA GCTCGATTTT AGAACGTTTA AAAGTGCCAC AAAAAATAAG GGATGTAATT ATAACAACAG TTAAAGAAAC GGAATCCCTT TCTAGTAAAT TAGAGTATAC CGAGAAAAAT AGTGAACTAG TACAAAATTT AGAAGAGATT CCTCTGGAAA CTATTCTATT TGTGTTAGCT GATAAGGAGA ATCAAAAAAT AAAGAACAAA ATATATTACT ATCTAGAGGA GTTAATTGGA GCTGGAGTTA GTATTACTGG GGAAGATCTT AAAACTCTAG GGATAAAGCC AGGTCCTGTT TATAAAACGG CTTTAGAAGA AGTTAGAAAA GCAAGATTAG ACGGACTGGT TACAACTCCT GAAGAAGAAT TAGAATATGT GCTAGATTTT TTTGAACAGA AAGGAGAAGA TATGCATGGA TAA
|
Protein sequence | MKVVTSHQNL DFDGLAAMLA YNKLNPDTKM VLPPKLNQNV RAFYSLYKDT FSFVERINFD LHKVNTLTMV DTSDEKRIGK LSSILPQVEQ LIIYDHHEYK NLSQNPTIEI NHEVGATSTI LVEELMNKEI ELSPLEATVI AMGIYEDTGN LTFSHVTARD VRAVAQLLEW GADVDIIRDY VTIILTQAQR GLLDELISNT KYLNINDYEI GLSITEREEY FSGAAALVHK LMEIEDVDLF FIIVKMDKKV FVIGRSRREE INVSRILSPF SGKGHDQAAS ATIKHGDLQH IETELIESLK LHLPVLLTAK AIMSAPVKTI SSSTTIQEAD HLLHKYGHSG LPVVQDKQND IDDEAGKIVG VISRRDIEKA KHHGFGHSPV KGYMSQKVIS ISPDTSIKEI QHLMVSNDIG RLPVIDSNAN LIGIVTRTNL LKIQHGQLTE EQERTNLYLK ELDEFSEDIT ELMARRLPSK ILGVLYLIGQ KADKEGFKVY GVGGFIRDLL LGKDNLDIDL VIEQDAISFA KLVSKHLNGD LKTFSQFQTA RLTLRSGTRI DFATARIEYY AFPAASPEVE ESTIKQDLYR RDFTINTLAV ELNCNSFGKL LDFFGGTKDL KKGVIRVLYN LSFVEDPTRI FRAIRFESRF NFNIEEQTLI FIKNSLETGV LDKLPGERLY EELRNIVDED EAVNTFRRMD ELGIFTKIFP NLNVSQDKLV KLQNIYDILN WYERENKQKQ EKFVSKEAMV FSCLLQDQPL PIISSILERL KVPQKIRDVI ITTVKETESL SSKLEYTEKN SELVQNLEEI PLETILFVLA DKENQKIKNK IYYYLEELIG AGVSITGEDL KTLGIKPGPV YKTALEEVRK ARLDGLVTTP EEELEYVLDF FEQKGEDMHG
|
| |