Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_0726 |
Symbol | |
ID | 6315697 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | - |
Start bp | 750390 |
End bp | 751742 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 642643104 |
Product | CapA domain protein |
Protein accession | YP_001916904 |
Protein GI | 188585359 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2843] Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.000469143 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 74 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTATCA AATACAAAGC TACCGACAAT TTAAGTGTGA CACTTCTTTC AAATATGTTA ATTTGTTTGT TACTACTATC AACAACTTTT TTATCAGCAT GTAACCAAAA TGAGGATCAA AACAGTAAAT CCGAGACCAT GAAAGAAGAA GAACAAAAAG AAGAAAAGGA AGCTAAAGAG GGCAAAGCCC AAGAAACAGA AACCGAACAG CAAAATTTAG AAGAAACCAA TGAAATTGTA ATTTCTGCCG TTGGAGATGT TATGGTCCAC GGTCCCCAGT TAGAAGCCCA GTACGACAGT GAGAAAAACG AATACGATTT TAATGATAAT TTTGAGTACA TAAAACCTTA TATTGACCAG GCAGACCTGG CTCTAGCTAA TTTGGAAACT GTTCTGGCGG GAGGAGAACG AGGTTATAGT GGTTATCCCA TGTTCAATAG TCCCGATGTC TTGGCAGATG CTTTGAAAAA TTCAGGTTTT AATGTTTTAT CAACGGCTAA TAACCACAGC CTTGATCAGG GAGAAGCTGG CCTAAGAAGA ACAGTAGAAG TTTTAGAAGA TAGAGGATTA AAGGCAATCG GCACAAAAAA ACAAGCAGAG GATGACAGTT ATTTGATCAA AGAAATTGAA GGAATCAAGA TAGGGTTATC AGCTTTTACA TATGAAACCC CTCGAATTGA CGGGCAACGA ACTATTAATG GCATCCCTAT GTCTGATAAG ACAGCCGAAC TAATCGATAG TTTCAATTAT GATGAATTAA ATCAAGATAT GGAAGATTTG ACAGAAAGGG CTGAACATTT ACAAGATCAG GGAGCAGATG TCATAGCTTT TTTCATGCAC TGGGGAACGG AGTACGAAAG ACAGCCCAAT GAATATCAAG AAAAAATAGC AGAAGAACTG GTGGACAGTG GAGTAGATAT AATCTTTGGG AGTCATCCCC ATGTAGTTCA GCCAGTCGAG GAAATTAAAA CACCTTCAGG AGAAGAAGGA ATTGTGATTT ATTCCATGGG AAATTTTCTT TCAAATCAAA GAAGGGAATA TCTGGACCGG CCTTATACCG AAGACGGTGT GATTATGCAT GTCACTATTG AAAAAGACGC CCATGCAGAC ATTGAAATAA CTGAAACAGC TTATACGCCT ACATGGGTAC ATAAATACCG TGAAAATGAA GAGAGGAACT ACGAGATTGT ACCTCTTCCC GATGCCTTAG AACATGAAAA AATTTACAAC TTACATACAG AAGAAAGCGT TAACAGAGCA CAACAATCAT TAGAAAATAC CAATAAGATA TTGACAACTG ATGATTCTTT TACACCCGAA CACATTCATT CACTGTTCCC TCCACAGGAC TAA
|
Protein sequence | MFIKYKATDN LSVTLLSNML ICLLLLSTTF LSACNQNEDQ NSKSETMKEE EQKEEKEAKE GKAQETETEQ QNLEETNEIV ISAVGDVMVH GPQLEAQYDS EKNEYDFNDN FEYIKPYIDQ ADLALANLET VLAGGERGYS GYPMFNSPDV LADALKNSGF NVLSTANNHS LDQGEAGLRR TVEVLEDRGL KAIGTKKQAE DDSYLIKEIE GIKIGLSAFT YETPRIDGQR TINGIPMSDK TAELIDSFNY DELNQDMEDL TERAEHLQDQ GADVIAFFMH WGTEYERQPN EYQEKIAEEL VDSGVDIIFG SHPHVVQPVE EIKTPSGEEG IVIYSMGNFL SNQRREYLDR PYTEDGVIMH VTIEKDAHAD IEITETAYTP TWVHKYRENE ERNYEIVPLP DALEHEKIYN LHTEESVNRA QQSLENTNKI LTTDDSFTPE HIHSLFPPQD
|
| |