Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_1043 |
Symbol | |
ID | 6314224 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | + |
Start bp | 1106425 |
End bp | 1107711 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 642643415 |
Product | Sarcosine reductase |
Protein accession | YP_001917215 |
Protein GI | 188585670 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.317745 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTGA CTTTGGAAAA AGTAAGAGTT GATGATTTAG TTTTTGGTGA TCACACCATG ATTAACGGTA CCACTTTAAT CGTTAACAAA AACGAAATCA TTAAAAAAGT TAAGGAAGAT GATCGGATTG CTAAAGTAAA CATAGACATT GTGAAGCCAG GTGAATCTGC TCGCTTATTT TCTGTAAGAG ATGTAATTGA ACCTAGAGTA AAAGTGGACG ATACGTGTGA ACTGTTTCCT GGTACGATTG GCAGTGTTGA CCAGGTGGGA GAAGGGGTTA CCAAAGTGTT TAAAGGTGCA ACGATAGTAA CCACCGGGAA AATTGTTGGA GTTAAGGAAG GTATTATAGA CATGTCTGGT CCAGGAGCAG AGTACACACC TTTCTCTCAT ACGAATAACC TTGTACTAGA CTGCGATCCG ATTTCTGGCC TGGAAAGTCG CCAGTACGAA GAGGCTTTAC GGCTGGCTGG CTTGAAAATC GCCCATTACA TTGGTGAAAA ATGTCAAGAG GCGACAGCCC AGGAAAAAGT CGCCTATGAA ACGCTACCCA TAGATCAACA GAAGCAAAAG TACCCTGAGC TACCGAAGGT GGGATACATC TATATGCTTC AAAGCCAGGG ACTATTACAC GATACTTATT TCTATGGTGT GGACGCTAAA GAGATACTAC CTACTTATAT TTATCCAACA GAAGTCATGG ACGGTGCTAT TGTCAACGGA AATAGTATTA TAGCTTGCGA CAAGAACACC ACCTACCATC ATTTGAATAA TCCTATCATT GAAGATTTAT TTGAATATCA CGGTAAAGAA ATTAATTTCT GCGGGGTCAT TATTACAAAT GAAAACGTTA CTTTAGAGGA TAAAGAGCGT TCTTCAAATT ACACTGCTAA ATTGTCGGAG CAATTTGGGT TTGATGGTGT CATCATTTCA AAAGAAGAGT ACGGAAACAC AGACACCGAT TTGATTATGA ATTGCAAGAA GATTGAGGAA AAGGGGATCA AAACGGTACT TGTAACGGAC GAGTATGCAG GCCGGGATGG TTCTTCCCAA TCCCTAGCAG ACGCTGACCC GAAAGCAGAT GCTGTAGTGA CTACTGGAAA TGCCAACGAA ACGATCATAT TACCCCCTAT GGACAAAATT ATTGGTAAAA TCGATTCAGA GGATCTGGAT GCTGGTAATT ATGAAGGGAA CCTCAAGCAT GATCAGAGTA TCGAAATAGA GATACAAGCT ATTATTGGGG CAACCAATGA ATTAGGTTTC AACAAGATGG GTGCAACCGA ATTCTAA
|
Protein sequence | MKLTLEKVRV DDLVFGDHTM INGTTLIVNK NEIIKKVKED DRIAKVNIDI VKPGESARLF SVRDVIEPRV KVDDTCELFP GTIGSVDQVG EGVTKVFKGA TIVTTGKIVG VKEGIIDMSG PGAEYTPFSH TNNLVLDCDP ISGLESRQYE EALRLAGLKI AHYIGEKCQE ATAQEKVAYE TLPIDQQKQK YPELPKVGYI YMLQSQGLLH DTYFYGVDAK EILPTYIYPT EVMDGAIVNG NSIIACDKNT TYHHLNNPII EDLFEYHGKE INFCGVIITN ENVTLEDKER SSNYTAKLSE QFGFDGVIIS KEEYGNTDTD LIMNCKKIEE KGIKTVLVTD EYAGRDGSSQ SLADADPKAD AVVTTGNANE TIILPPMDKI IGKIDSEDLD AGNYEGNLKH DQSIEIEIQA IIGATNELGF NKMGATEF
|
| |