Gene Nmul_A0131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0131 
Symbol 
ID3785779 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp136565 
End bp137578 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content58% 
IMG OID637810201 
Productdihydrouridine synthase TIM-barrel protein nifR3 
Protein accessionYP_410832 
Protein GI82701266 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.1484 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATCG GTTCTCATAC TCTCAAAAAC AACCTTATCG TGGCGCCTAT GGCGGGAGTG 
ACAGATCGCC CGTTCAGGCA ATTGTGCAAA AGCATGGGCG CCGGAATGGC TGTGTCCGAA
ATGGTGTCGA GCAATTCGCT CCTCTGGGGT TCCGAGAAGA CACGCCGCCG CGCCAATCAT
GAAGGCGAGG TGGATCCGAT CTCGGTGCAG ATCGCCGGCG CCGACCCGGC GATGATGGCG
GAAGCTGCGC GCTACAATGT CGCGCAAGGA GCCCAGATCA TCGACATCAA CATGGGTTGC
CCTGCCAAGA AGATTTGTAA TGTCATGGCA GGCTCTGCAT TGCTGCAGGA CCCGCCGCTG
GTCGGGCGGA TTCTGGATGC CGTCATAGGC GCGGTGAGGG TGCCTGTCAC CCTCAAGATT
CGCACGGGGT GGGATACCCA GCACAAGAAT GCGCTCTCCA TTGCCCGCAT TGCGGAGAAT
GCCGGCATCC AGGCGCTTGC TATCCATGGA CGTACGCGCG CCTGTGCTTA CACCGGCCAT
GCCGAATACG ATACCATCGC GGCAGTCAAG GCTGAGGTGC GGATTCCCGT TGTCGCCAAT
GGGGACATTA CAACACCGGA AAAAGCAAAA CACGTGCTTG ACTACACGGG AGCGGATGCG
GTCATGATCG GCCGCGCAGC ACAAGGCCGC CCCTGGATTT TTCGCGAGAT CGATCACTAT
CTGGCTACCG GCTCACACCT TCCGCTGCCT GAAGTAGCGG AGATTCACCG TGTACTCGTC
GCACATTTGC ACGATCTATA TAGCTTCTAT GGCGAGTATT CGGGGGTCCG CATCGCCCGC
AAGCATATTT CCTGGTATAC CAAAGGACTG GTCGGGTCAG CGGGTTTCCG TCATGCCATG
AACCAGCTGC AGTCTACGGA CCAGCAGCTG TCTGCGGTTA ACGACTTTTT CAGTGAGCTT
GCCGGCTACG GGCGGCGGTT GACCTACGTC GAGGCTGAGG AACTGGTGGC ATGA
 
Protein sequence
MKIGSHTLKN NLIVAPMAGV TDRPFRQLCK SMGAGMAVSE MVSSNSLLWG SEKTRRRANH 
EGEVDPISVQ IAGADPAMMA EAARYNVAQG AQIIDINMGC PAKKICNVMA GSALLQDPPL
VGRILDAVIG AVRVPVTLKI RTGWDTQHKN ALSIARIAEN AGIQALAIHG RTRACAYTGH
AEYDTIAAVK AEVRIPVVAN GDITTPEKAK HVLDYTGADA VMIGRAAQGR PWIFREIDHY
LATGSHLPLP EVAEIHRVLV AHLHDLYSFY GEYSGVRIAR KHISWYTKGL VGSAGFRHAM
NQLQSTDQQL SAVNDFFSEL AGYGRRLTYV EAEELVA