Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_2212 |
Symbol | |
ID | 6316592 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | - |
Start bp | 2347769 |
End bp | 2348998 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 642644600 |
Product | molybdopterin binding domain |
Protein accession | YP_001918366 |
Protein GI | 188586821 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0521] Molybdopterin biosynthesis enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000001586 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.00707587 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAATACG AGTATGTAAA AACTCGAGAA GCTGTTGGTA TGGTGATTCC CCATGATATA ACAGGAATCA AACCTGGTGA ATTCAAGGGA GCTAAATTTA AAAAGGGACA TGTCATTACC GAACAGGATA TTCCGGTATT ACTTTCTTTA GGAAAAGAAC ATATCTATTC AATGAAATTA GAAGACGGAG AGGTTCATGA AGACGAGGCC GGAACCCGCA TTGCCCAAGC TGTTAGCGGT CACGGGATTG CCTTCAAGGG TCCGGGGGAA GGGAAGGTAG ATCTATTGGC TGAAGTCCCC GGATTATTGA AAGTAAATGT ATCTTCTTTA ACTCGGATTA ATGCTTTGGA TAAAGTGATT TTGGCTTCAT TACACAATAA TACTCCGGTA AAAGAGGATG GAAAACTTGC CGGTACCAGG GTGATCCCCC TGGTAGTTTC AGAACAGGTA GTCCAAGAAG TAGAAGAGAT TTGCCGAGAA GAGGGCCCTG TGATTTCAGT AAAACCCTTT GAAAAATTGG AGATGGGTGT TTTGGTAACA GGTCAGGAAG TTTATCAGGG GCGGGTAGAA GATGCCTTCT ATCCCGTACT GAAAGAAAAA GCCAGAGAGC ATGGTTTGAC AGATCCTCGG GTTTGCTACG CCCCTGACGA TCCTTGTGAA ATCAGTACTA ATATTCAGGC CTTGATTGAT CAGGAGGTCG ATTTAGTAGT GGTTACAGGG GGAATGTCCG TTGATCCCGA CGATGTTACT CCCAAGGGGA TTCGCATGAG TGGTGCCCAA ACTATCAGTT ATGGAGCCCC CGTGCTTCCA GGTGCTATGT TTTTGATGGC TTATCATCCT GGGAATCAGG CGAAAGAGGC CAAACTAGGA GAAAACAGAC CAAAGGCAAA GGATTCATTT GATGTATCTA ATTCAGGTGA TGCAAAAGAT CCAGCTAATC CAAGTCACGA CAGGAAAGAA GTCGGAGCTG ATAAAGTTAA CTTCAGGGAG AATCCCAGGG AAAATCCAAG GGAAAATCCC AGGGAAGTTC CCAGAGAAAT TCCCGTAGTA GGCCTACCTG CCTGTGGCAT GTTCTTTCAA ACTACCGTTT TCGATTTAAT TTTTCCCAGG TTGTTGGCAG GGGAACGGGT TAGTTCAGAG GAAATCGCCG CTCTTGGTCA TGGCGGTCTC TGTCTTCAGT GTGAACAATG CAAGTATCCC AACTGTCAAT TTGGAAAGGG GAGTGTATAA
|
Protein sequence | MKYEYVKTRE AVGMVIPHDI TGIKPGEFKG AKFKKGHVIT EQDIPVLLSL GKEHIYSMKL EDGEVHEDEA GTRIAQAVSG HGIAFKGPGE GKVDLLAEVP GLLKVNVSSL TRINALDKVI LASLHNNTPV KEDGKLAGTR VIPLVVSEQV VQEVEEICRE EGPVISVKPF EKLEMGVLVT GQEVYQGRVE DAFYPVLKEK AREHGLTDPR VCYAPDDPCE ISTNIQALID QEVDLVVVTG GMSVDPDDVT PKGIRMSGAQ TISYGAPVLP GAMFLMAYHP GNQAKEAKLG ENRPKAKDSF DVSNSGDAKD PANPSHDRKE VGADKVNFRE NPRENPRENP REVPREIPVV GLPACGMFFQ TTVFDLIFPR LLAGERVSSE EIAALGHGGL CLQCEQCKYP NCQFGKGSV
|
| |