Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_2497 |
Symbol | |
ID | 6316390 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | + |
Start bp | 2687985 |
End bp | 2689826 |
Gene Length | 1842 bp |
Protein Length | 613 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 642644883 |
Product | polysaccharide biosynthesis protein CapD |
Protein accession | YP_001918648 |
Protein GI | 188587103 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1086] Predicted nucleoside-diphosphate sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.495773 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 68 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGCAA AATACATAAA AAGAGGAATC CAGGTCCTAG TAGATATGTA TCTGTTCAAC CTGGCCTTTA TCATGGCACT ATTGTTTAGA TTCGATGGTT TTGTCCCGGA AGAATACTTG ATTATGTATC AGGATAGTTT CTGGTGGATA ACGGCAGTAT TCATTCTCAC TTGTATAGTC TTTCGGCTGT ATCACAGATT GTGGTCTTAT GCCAGTATTC ACGATGTGAT AGTCCTGGGA ACAGCCATCA CCATCGGTTC CATTTCCGTT TATGCAGCTA CTGTTGCTTT AGACATGATG TTTCCCAGAA GCATTTATCT TATCTCCTTA TTTTTGAACC TGGTATTTCT CGGAGGATAT CGGCTGGGCT TTCGAGTCCT CAATATATAT AAACGGTTTG GCTTGAGTCC TCTAAAAAAG CAAAAACCCA AAAATAGAAA AAACGTACTA ATAGTAGGTG CTGGAGATGC CGGCAATATG GCTCTAAAGG AATTAAACCA ACACCAGCTA GACCTCTCAG TAAATATTGT TGGTTTTTTG GATGATGACC AGGAAAAACA GGGTTGTCGT GTGAACGGAG TCAAAGTTCT AGGTTCTACC GATGAACTTG TTAAAATCTC CAAGAAAAAA GATGTAGACG AAGTTGTAAT TGCCATGCCT TCAGCTCCTC AACAGGTGAT TCGCTATCTC ATAAAAACTT GTAGCGATTA TTCTATTAAA ACTAAGATTA TACCCGCTGT TCATGACTTA ATTTCTGGTA GGGTTTCTAT CAATCATTTA AGAGAAGTAG AAATCGAAGA CCTTTTGAAA AGGGATCCCA TCGAACTGGA TATAAATCAA ATTGCCGGTT ATTTAACTAA TAAAGTGGTC TTGGTTACAG GGGCCGGAGG CTCCATCGGT TCGGAACTCG TCAGGCAGAT TGCCAACTTT AATCCTCAAA CTATATTACT TTTAGGACAC GGTGAAAACA GTATTTTCGA AATTTATAGA GAAATGGTAG AAAAATTTCC TAAAAATAGA TTGGTACCCA TCATTGCCGA TGTTAAAGAC AGGGAAAAAA TATTTCAAAT CGCCAAAGAT TACGAGCCCG ATGTAGTATT TCACGCCGCC GCACATAAAC ACGTGCCATT AATGGAACAA AATCCAGAAG AAGCCATTAA AAACAATATC TACGGCACAA AAAACCTGGT AGATGCAGCT CATCACCATA AATCCCAGCG CTTTGTATTG GTTTCCACAG ATAAAGCCGT TAACCCCACT AGTGTCATGG GAGCTACTAA ACGAGTTGCA GAACTAATCG TAGAAAACAT GGCTCAAACA AGTGAAACCA AATATACAGC TGTCAGGTTT GGTAATGTCT TGGGAAGCAG AGGTAGTGTG ATACCTCACT TTAAGGAACA GATCAGTAAA GGTGGGCCAG TCACAATTAC CCACCCCGAA ATGACAAGAT ACTTCATGAC AATTCCTGAA GCTTCCCAGT TAGTCATAGA GGCCGGAGGA ATGAGTAAAG GCGGAGAGAT TTATGTACTA GATATGGGGC AACCGGTAAA AATTGTTGAC CTGGCAAAAG ATCTAATCAA ATTGTCCGGG CTGGAGCCGG AAAAGGATAT TAAATTAAGC TATACCGGAA TAAGGCCTGG AGAAAAACTC TACGAAGAAC TCCTGACAGA AAAGGAAAAT GTCTCCAAGA CAAAACATGA CAAAATTTAC ATCACTGACA ATACGGTTTC CGATACAAAT GAAATGAAAA AAGAACTACA TGCCATGGAG GAAATAATAG CTGCAGATTT AACTTTCTTA GAAAAACAGC TCCTACAAGA ATCGAAAACT GGAACTGAAT AG
|
Protein sequence | MTAKYIKRGI QVLVDMYLFN LAFIMALLFR FDGFVPEEYL IMYQDSFWWI TAVFILTCIV FRLYHRLWSY ASIHDVIVLG TAITIGSISV YAATVALDMM FPRSIYLISL FLNLVFLGGY RLGFRVLNIY KRFGLSPLKK QKPKNRKNVL IVGAGDAGNM ALKELNQHQL DLSVNIVGFL DDDQEKQGCR VNGVKVLGST DELVKISKKK DVDEVVIAMP SAPQQVIRYL IKTCSDYSIK TKIIPAVHDL ISGRVSINHL REVEIEDLLK RDPIELDINQ IAGYLTNKVV LVTGAGGSIG SELVRQIANF NPQTILLLGH GENSIFEIYR EMVEKFPKNR LVPIIADVKD REKIFQIAKD YEPDVVFHAA AHKHVPLMEQ NPEEAIKNNI YGTKNLVDAA HHHKSQRFVL VSTDKAVNPT SVMGATKRVA ELIVENMAQT SETKYTAVRF GNVLGSRGSV IPHFKEQISK GGPVTITHPE MTRYFMTIPE ASQLVIEAGG MSKGGEIYVL DMGQPVKIVD LAKDLIKLSG LEPEKDIKLS YTGIRPGEKL YEELLTEKEN VSKTKHDKIY ITDNTVSDTN EMKKELHAME EIIAADLTFL EKQLLQESKT GTE
|
| |