Gene Nmul_A1246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1246 
Symbol 
ID3786022 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1430388 
End bp1433450 
Gene Length3063 bp 
Protein Length1020 aa 
Translation table11 
GC content53% 
IMG OID637811331 
Producttranscriptional regulator 
Protein accessionYP_411941 
Protein GI82702375 
COG category[R] General function prediction only 
COG ID[COG3899] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.571861 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGACAA CTTACTTGGA ATTCCATCCT TTCAGGCTGG ATAAGATCAA CGCCATCCTA 
TGGCGCAACG ATCAAGTAGT ACCGTTACGT CCGAAAAATT TCGCTATGTT GTGTTACCTG
GCGGAGCGGG CTGGCACCCT GGTGACCAAG GATGAGTTGC TTGACGCGGT ATGGCAGCGC
CGCTTCGTCG GAGAAGCAGT GCTGAAAGTA TGTATCAACG AAGTGCGGCG GGCTCTTGGA
GACAGTGTTT CCGCTCCCAC CTATCTTTTA ACCGTTCCCA AACGCGGCTA CCGTTTCATC
GCGCAGGTTA CTGAAGTCAA GTCGTCAGAG GAAGTGGAAG AAGTAATCTG CCCCGTTTTC
CCCAAAAACC AGCGCTCTGA CAAGGTCGCA TATTGGATAG ACCGCCCGTC TCCACAAGCT
CGCTTGCTGA CAATCTGGCA AAAATCACTG GTGGGTTCGC GCCAGATCGT TTTTGTTACC
GGGGAAAGTG GAATCGGCAA GAGCACGCTG ATCGAGATGT TTCTCTCCAC AATATCCAGT
CAAAGCCCGG GGGTTTTGCG CATGCGCTGC ATAGAGCGCT TTGGCCAGGG TGAAGCCCTG
CTCCCGATGA TCGAGGCAAT TGAAAAGCGT TGCAATGCGC CAGGGGGGGG AAAACTCGTT
GAACTGTTAT ATCGTCACGC ACCGGTTTGG CTTGCGCAAT TGCCGTCCGT GCTTCGCCCC
GAGGAGCGTG TGGCGCTTCA GCAGGAGATT TTTGGCGCAA GCCGGGAACG CATGGTACGG
GAAGGTTGTG AACTGCTGGA AACCTTGAGC AAAGTGCCCC TGATTCTCGT GCTTGAGGAT
CTTCATTGGA GTGATCATGC GACTCTTGAT TTTTTAAGCT TGCTTGCGCA GCGTCATGTG
CCAGCATACT TGATGGTGCT CGCCACCTAT CGCCCCATCG ATGCAAGCCA GCGGGTGCAC
CCAGTTACAG AAGTTCACCG GGATTTGCAG TTGCGAGGAA TCTGTTCTGA AGTCGCCCTT
GAGCCGTTTT CCTGCAATGA AGTCAAACAT TACCTCACTC GACGTTGCCC AGGCATAAAT
ATTCCCGATT CGATCAGTCA GGCACTTTTC ATAAGAACCG GTGGACATCC TTTGTTCATA
TCCAATCTGA TCGAATATTT AATGGAGCAA CATCAATGGT CACCGTTATC CCAGCAGATC
GGAATCGATA CAGCGCTGCC GGAAACGATC CGCCGCGTCA TCGAGCGTGA AATCGAACGG
CTCAGCCATG ATGAACAACG GGTGCTGACG GTGGCGAGTG TCGCGGGAAT GCGGTTTAGT
GGGAACCTGC TTTGCAGTGT TCTGGGTATG GAGATCGCCG AAGCAGACCG CTGCTGCAAT
GCCCTGGTTA GACGAGGCCA GATATTGGTG TCCGATGGAA TGGAGCAAAG CACGAAAGGA
GTTGTCGCGG GCTACTATGC ATTCCGCCAT GCCCTGTACC TCGAAGTCTT CTACCAACGG
CTTTCTCCCT CCGAAACGAT ACGAATGCAC CTTCGCATCG GAGAATACCT TGAAAAGGCA
TACGGCGAGC AAAACGTGGA GCATGCAGGG GAACTCGCCC TGCATTTTGA AAACGGATGG
GATTGGCTTC GCGCCATCCG CTATCTCGTG CAGGCGGCTG ACAACAGCAC CAGGCGCTTT
GCCAATCGGC AGGCACATGA CTATCTGGCG CGCGCCGTTC GAATGATAGA ACGCTTGCCT
GATGAGCAAC AAGCGAAAAC ACGCATAAGC CTCCTCAAGC AATCAGCGGC GGTAAGACGC
TCAATGGGCG ATATGGCAGG AGCAAAAACT GATCTGGAGA AAATGCTGGC AGCCGCCAAA
GCTTTGGGAG ACAGCCGGGA ACAGGCGGTG GGACTCCTCG AATTGAGCCG CGTCCTTGTT
TTGTTGAACC GTCTTGAATG TCTCGAGTTT GCGGAGCAGG CGGTCTCTGC TTCCACAGCA
CTTGAAGACA AGGTTTTTCA TTCAATCGTC AAGGGCATGT GGGGTGGCTT GAATCTGTTG
CTCCGGCCAT GGCGGGAGGA TTATTCTGCC GCTTGTCACG AGTCTATGGA TGCGGTGCGC
GCAACGGGAA ATCCCCTGGC TCTTCACTCG CGCTTGACTC AGCATATTTA CGTTGAACTG
CTTGCCTCGA ATTACCAGGC TGCTGCGACG ACAGCAGTCG AAGCTCTGGC GTTATCACGC
GTAATGGGAG ATGGCTACAT GTTCATCGCG GGCCATTATT ACTATGGATT AACGCTGCTG
CATAAGGGTG AATGGGGCAG ATTGCGCGAA ACCGCAGAGC AAAGCAGGCG CGCGTTCGAG
GGGCATGACG CTGGCTTGCT GCTTCGGTTG CATCGCCACA TCCTGCTGGG ATGGTTGCAT
GTGGTAAGTG GCGATTTTTC GGGTGCCAAA GCGTATTGCG AGGAAGCGCT GTCAGAAGGC
GTCGGTGCCT GGGCTGACTT CGTTTCGGTT CATTGTTCCG CAATCCTGGG AAAGGCCTTG
CACGGGCTAA AAGATTATGC GGGAGCCATT CGATGCTTTG ACGCTTTTTT TCAGGCGGAA
AAGAATAATG CTCTCCCGAT ATTCTCCAAC TATTTCTTCC CTGCTTGCTT GGGAATGGGA
GAAACATGGC TCGCACTGGG AAAACTGGAT AAGGCGCGTC GCTATGCGCA GCGCTTATAT
GATCTTTCCA GCGGTCCCTC TGAACGCACC TACCTTGCAC TCAGTCATCG CCTGTTTGCC
GAGATTGCGA TAGTAGAAGA GAGCTGGGAC GAAGTACATT CGCATATTAC AAAAGCACTT
GAAATCGTGG AAAATGCGGA AATTCCCCTT GCAGCCTGGA GGGTCTACGC GACTGGAGAA
AAACTGCATT ATCGGCAGGA TGACGGGAAA AGGATGGGTT ACTACCGATC AAAAAAACAG
GATGAAATCG ATCAACTCCT CAACTCTCTT CAACCGTCGG ATCCTTTAAG GAAACATTTG
CTGAATCTTG CTGGAACCCA TGAATGCGAT TCTCTTTCAC CGGTCAGATA CAGCCCGCTT
TAA
 
Protein sequence
MQTTYLEFHP FRLDKINAIL WRNDQVVPLR PKNFAMLCYL AERAGTLVTK DELLDAVWQR 
RFVGEAVLKV CINEVRRALG DSVSAPTYLL TVPKRGYRFI AQVTEVKSSE EVEEVICPVF
PKNQRSDKVA YWIDRPSPQA RLLTIWQKSL VGSRQIVFVT GESGIGKSTL IEMFLSTISS
QSPGVLRMRC IERFGQGEAL LPMIEAIEKR CNAPGGGKLV ELLYRHAPVW LAQLPSVLRP
EERVALQQEI FGASRERMVR EGCELLETLS KVPLILVLED LHWSDHATLD FLSLLAQRHV
PAYLMVLATY RPIDASQRVH PVTEVHRDLQ LRGICSEVAL EPFSCNEVKH YLTRRCPGIN
IPDSISQALF IRTGGHPLFI SNLIEYLMEQ HQWSPLSQQI GIDTALPETI RRVIEREIER
LSHDEQRVLT VASVAGMRFS GNLLCSVLGM EIAEADRCCN ALVRRGQILV SDGMEQSTKG
VVAGYYAFRH ALYLEVFYQR LSPSETIRMH LRIGEYLEKA YGEQNVEHAG ELALHFENGW
DWLRAIRYLV QAADNSTRRF ANRQAHDYLA RAVRMIERLP DEQQAKTRIS LLKQSAAVRR
SMGDMAGAKT DLEKMLAAAK ALGDSREQAV GLLELSRVLV LLNRLECLEF AEQAVSASTA
LEDKVFHSIV KGMWGGLNLL LRPWREDYSA ACHESMDAVR ATGNPLALHS RLTQHIYVEL
LASNYQAAAT TAVEALALSR VMGDGYMFIA GHYYYGLTLL HKGEWGRLRE TAEQSRRAFE
GHDAGLLLRL HRHILLGWLH VVSGDFSGAK AYCEEALSEG VGAWADFVSV HCSAILGKAL
HGLKDYAGAI RCFDAFFQAE KNNALPIFSN YFFPACLGMG ETWLALGKLD KARRYAQRLY
DLSSGPSERT YLALSHRLFA EIAIVEESWD EVHSHITKAL EIVENAEIPL AAWRVYATGE
KLHYRQDDGK RMGYYRSKKQ DEIDQLLNSL QPSDPLRKHL LNLAGTHECD SLSPVRYSPL