Gene Nmul_A0801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0801 
Symbol 
ID3785845 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp917121 
End bp918647 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content51% 
IMG OID637810887 
Productregulatory protein GntR, HTH 
Protein accessionYP_411500 
Protein GI82701934 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0129569 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAATTC CCATTGAACT CGACCATAAC AGTCGTCAAT CGCTGCAAGG CCAGATTTTT 
GACCAGTTAC GCCACCTTAT CCTCGGCGGT AAGCTAAAAC CCGGTACACT CATCCCGGCG
AGTCGCGTAC TTGCCGAACA ACTGGGTATT TCTCGTAATA CTGTTTTGCT GGTTTATGAT
CGTTTGATTG CCGAAGGCTA TCTTCAGGCC CGGAAGGCAA TTGGAACGTA TGTAAACCTT
GAGCTTCCTG AAACCTGCCT TTCTGCCACC CGCAGGGTGA CCCCCTCCTC TCCTGGTGAT
GAAGAATTTG TCAAACAGCC GGTAATCCCA TTTATCGGCC ATATACACGC CAACATCACT
GTTCAGCATC TCGATTTTGA TTTTTGTCCG GATCGCATCG GTCTCGACCT GTTTCCTCAT
AAGATATGGC GCCGCTTTGT AAATAAGACA TTCATGTCTG CAAACATGCA CTTCGAGGAG
TGTTGCGATC CCGGTGGGCT ATCGGAACTG CGGGAAGTCA TCGCTGCCCA CCTTGGAGTC
GCGCGGGGTA TCAGCGTTTC ATCAGATCAG ATAATTATTA CGGCCGGCTC TCAGGAGGGA
TTGAATCTTG CTGCTCGCCT TCTTGTAAAA GAGGGGACAA TGGTTGCAAC GGAAAATCCC
TGCTACCAAG GCGCCACTTC GCTTTTCGAG AGTTACTATG CAAAATTGAT TCCTGTGCCT
GTGGATGAGG CAGGGCTCGA CGTTGAACAA CTACCAAAGG AAGGAGTTGC TCTTTTATTT
GTGACTCCTT CACATCAATT TCCCCTGGGC TTCACCTTGC CCATCGAGCG CCGGTTGAAG
CTCCTGGATT GGGCGCGGCG TTGTGGGGCA TTCATCATAG AGGACGATTA CGATTCAGAT
TTTCGATATC GCGGATCGCC CCTGACCGCT CTCATGGGAC TGGATGATTA TGGTTGTGTC
ATGTATCTGG GTACTTTCTC CAAATCCATG GGTGCGGGCC TGCGCCTAGG CTACCTAGTG
GTACCCAAAG CATTGATTCC TGCTGCCCGT TCTGCAAAAG CTCTGCTTAA TAATGGGCAT
GCCTGGCTGG ATCAAGCAAC TATGGCCGAA TTTATCCGCA GCGGCGCTTA TGGCAATCAT
CTGAGACGCA TGCGAAATAT GTATGGTAAG CGCCGTGATT GTCTGGTGGA AGCACTGCGG
GAACATTTTG GGGCCATCCG CCTTAGCGGC CTTGAGAGTG GCCTGCATCT GGCATGGCAT
CTTCCTGATC ACTATCCACC TGCAGCTCAG CTCCAGACAA TTGCATTACG TCATGGAGTA
AGAATCTACT CGATCGGAGG GGGTACGGGC TATGATTACG GTGGCTGTCG CTACAGCGCC
CGTACGCTCG TGTTAGGTTT TTCTTCCCTC AACGAATACC AGATCCGGGC TGGTGTTCGG
CGAATTGCTG ATGCTTTCGC GGATATGTCA CTCAACCCCC TTTCCATGCG AGAGAAAGTG
GATAACGTAG CGACACCCTC TCTTTAG
 
Protein sequence
MAIPIELDHN SRQSLQGQIF DQLRHLILGG KLKPGTLIPA SRVLAEQLGI SRNTVLLVYD 
RLIAEGYLQA RKAIGTYVNL ELPETCLSAT RRVTPSSPGD EEFVKQPVIP FIGHIHANIT
VQHLDFDFCP DRIGLDLFPH KIWRRFVNKT FMSANMHFEE CCDPGGLSEL REVIAAHLGV
ARGISVSSDQ IIITAGSQEG LNLAARLLVK EGTMVATENP CYQGATSLFE SYYAKLIPVP
VDEAGLDVEQ LPKEGVALLF VTPSHQFPLG FTLPIERRLK LLDWARRCGA FIIEDDYDSD
FRYRGSPLTA LMGLDDYGCV MYLGTFSKSM GAGLRLGYLV VPKALIPAAR SAKALLNNGH
AWLDQATMAE FIRSGAYGNH LRRMRNMYGK RRDCLVEALR EHFGAIRLSG LESGLHLAWH
LPDHYPPAAQ LQTIALRHGV RIYSIGGGTG YDYGGCRYSA RTLVLGFSSL NEYQIRAGVR
RIADAFADMS LNPLSMREKV DNVATPSL