Gene Nmul_A2432 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2432 
Symbol 
ID3784127 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2770980 
End bp2773799 
Gene Length2820 bp 
Protein Length939 aa 
Translation table11 
GC content48% 
IMG OID637812522 
Productputative signal peptide protein 
Protein accessionYP_413113 
Protein GI82703547 
COG category[S] Function unknown 
COG ID[COG3868] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01370] possible cysteinyl-tRNA synthetase, Methanococcus type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0164247 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCATGA CGGTCATCAA TGGGATAACG CCAATGATGA AGTACCAGTC GACTTTCGGG 
AATCATTATG CTTTTTTAAG GCTCATCTCT AGCATGCTTT TATCCTCTAT CTGCTTGCCC
GCATACGCGA GTACCGCGAT TTTCTACGGC CGACCTGCCC CGGTGGATCT ACTTTCACAT
TTTACGCAGG TCATAGTAGA ACCCGAGAAC ATGGATAATG TGGATTATCT TTTTGACAAG
GGAACTAAGG TATTTGCCTA CGTCAGCGTC GGAGAAGTCC ATGCGACACG AAGCTGGTAT
TCAGAGATCC CGCAAAGCTG GTTTATTGGC AGTAATAAAG AATGGGGAAG CAGTATCCTC
GACCTGACTC AGCAAGGATG GCATGACTAT CTTCTTGACA AGTATTTGTC CCGGTTATGG
GAACAAGGTT ATCGTGGATT TTTTTTGGAT GGACTGGAGA CATACCAGAG GGTTGCGGTT
GAACCTGCTG CCCGGTTAAA CCAGGAAAAA GCATTATCCA ATCTGATTAA AAGTATACAT
GAGCGTTTCC CGGGAGTTGA ACTTATACTT AACCGGGGTT TCGATATCTT GCCGCATGTA
GGCCAGTATG CGGTTGCATT AGCCGCCGAA TCGCTATTCC AACGCTGGGA TCCGATCGAT
TTGGAATATA CGGAAGTTGC TGAGCCGGAT CGTAACTGGC TCTTACAGAA GTTATGGCAG
GCACGAGATC GGTATGAACT GCAAATAATC GTTATTGATT ACGTTGATCC AAAGCAGAAG
GATTTAGCGC GAGCTACCGC AAAAAAAATA TCTGATCTGG GCTTTACGCC CTGGGTGGCA
AACCCCGCAT TGGATATTTT CGGCATTGGC AAGGTGGAGA TATTCCCTAG ACGCATCTTG
GCGCTCTATG ACGGGCACGA ATATCCGGAA GGGCTGCAGC AGACGGATAT CCATAAATTG
CTAGCCATGC CACTGGAACA TCTGGGTTAT ACACTCGATT ATCTGGACGT GAATGCTGGA
TTACCCGGCA ATTTATTGAC AGGGCAATAT GCCGGAATCG TGAGCTGGTT CAACAACGAT
GCTTTATTGC ACCCGGCGGT TTATAGAGAC TGGCTGTTGC GGCAGATGGA GGCTGGGGTA
AAAGTTGTTA TTCTCGGAAA TTTAGGTTTT AAAGCTGATA ATGCTTTTTT AAAGCATCTA
GGGGTGGAAC TAAGTCAGTC GGAATCCCTG GAGGGCTCAC TTATGGTCAG GGGGAGCGAT
AAAATTATTG GTTATGAGGC GAAACCTCAA CCAGTGACAC GGGGGTTCAC TGCCTGGCAA
GTATTGGAAG ATGGTGTTCA AAAGCATCTA AGTTTTACAG GTCAATCGGA AGAGCCGGTG
GTTGCAGTAT TTTCCGGGAA TTGGGGCGGG ACCGCCTTGC ATCCTTATGT GATAGAAGCT
GGTTACCAGG GGCTGAAACG ATGGATCATC AATCCATTTG AGTTTTTAAG CACTGCTCTT
GATGTAAAAG GAATACCCGT ACCCGATGTC ACGACCGAGA ATGGCCGGCG TTTATTGCTG
GTACAGATTG ATGGTGATGG GGCTGGGAAT AAAGCAGAAA TACCTGGAAC GCCGTTGGCG
ATAAAAGTCA TCCAGGATCA ATTCTTGCAA AAATATAACC TGCCTTCTAC GGTTTCCATA
ATTGAAGGAG AAACGGCGAA CCGCGGCTCC TCAGACAAGG TAACTGAGGC CGAGAAAATA
GCACAAGATA TTTTCAAGTT GAACCATGTT GAAATAGCAA GTCATTCCTA TAGTCATCCT
TCCGCATGGT TTCCAAAGAA CACTATTGAT GCTGATGGAG ATGAGTATCA GCTTTCTGTC
GATGGATATG GATTTGACTT GCATCGGGAA ATCGCCGGAT CCGTGGAATA TATTAATAAC
ACCTTGGCGC CTGAGAACAA GCGGGCGCGT GTTTTCTTAT GGACCGGGGA CGGGCTGGCA
AGCAGAGAGG CTCTAGCTTT AGCGCGTTCA CTTGGGTTGG AGAACATGAA TGGCGGCGGC
GCGACGATTT CCAATGATGA AAAAACCGTG ACACGTGTGC CCCCCCTGGG TTATGTAATG
GATGACCAGC TCCAGTTATA TGCCCCCATG GCGAGCGAGC ATGTTTATAC AAACAGATGG
CAAGGTCCCT TTCACGCATT TAGTCGCGTC GTCGAAACCA TGCAACTCAC TGACCGCCCC
CTTCGTTTGA AACCATTGCA TATTCATTAT CACTTTTATT CAGGAAGTAA AACGGATTCC
ATCAATGCGC TGAAATCCGT GTATGAGTGG ACTGTTAAAC AGGAACATCG ACCTGTCTGG
GTCAGCGAGT ACGGTCAGAA GGTAAATGAA TTCCGCAATC TTACAATGTC TCGCCGTGTC
GATGGTGCCT GGGATATTAG AGGGCTGAAT ACACTACGCA CGTTACGTCT GCCAGTGTCG
ACAGGGTGGC CCGATTTGGA AAGATCGGAG GGGATTGTAG GGGTCCGTGA TGCGAGTCAA
GGGCGATATG TGCATCTTTT GCCTAATCAT GGTCAAGTGC TGCTGTATAT GACACCCGAG
CTTCCCTTAT CTCCTTATTT ATCCCACAGC AACGGGGAGA TTGACGAATG GCAGAAAATG
TCAGGGGGCG TTAATTTTCG AATGCGTGCG CATACGCCAT TAGAAATGAC GGTCGCATCT
GAAGGAGAGT GCCATATCAA TTGGTCCGGT GGCACTCTGG AAGGGCAGCG CGAGGGTCAA
GGCTGGAAAT TTGTTTTTCC AGTGGCCGAT TCCGGGGATG CGACCCTCGT GTGTTCTTGA
 
Protein sequence
MPMTVINGIT PMMKYQSTFG NHYAFLRLIS SMLLSSICLP AYASTAIFYG RPAPVDLLSH 
FTQVIVEPEN MDNVDYLFDK GTKVFAYVSV GEVHATRSWY SEIPQSWFIG SNKEWGSSIL
DLTQQGWHDY LLDKYLSRLW EQGYRGFFLD GLETYQRVAV EPAARLNQEK ALSNLIKSIH
ERFPGVELIL NRGFDILPHV GQYAVALAAE SLFQRWDPID LEYTEVAEPD RNWLLQKLWQ
ARDRYELQII VIDYVDPKQK DLARATAKKI SDLGFTPWVA NPALDIFGIG KVEIFPRRIL
ALYDGHEYPE GLQQTDIHKL LAMPLEHLGY TLDYLDVNAG LPGNLLTGQY AGIVSWFNND
ALLHPAVYRD WLLRQMEAGV KVVILGNLGF KADNAFLKHL GVELSQSESL EGSLMVRGSD
KIIGYEAKPQ PVTRGFTAWQ VLEDGVQKHL SFTGQSEEPV VAVFSGNWGG TALHPYVIEA
GYQGLKRWII NPFEFLSTAL DVKGIPVPDV TTENGRRLLL VQIDGDGAGN KAEIPGTPLA
IKVIQDQFLQ KYNLPSTVSI IEGETANRGS SDKVTEAEKI AQDIFKLNHV EIASHSYSHP
SAWFPKNTID ADGDEYQLSV DGYGFDLHRE IAGSVEYINN TLAPENKRAR VFLWTGDGLA
SREALALARS LGLENMNGGG ATISNDEKTV TRVPPLGYVM DDQLQLYAPM ASEHVYTNRW
QGPFHAFSRV VETMQLTDRP LRLKPLHIHY HFYSGSKTDS INALKSVYEW TVKQEHRPVW
VSEYGQKVNE FRNLTMSRRV DGAWDIRGLN TLRTLRLPVS TGWPDLERSE GIVGVRDASQ
GRYVHLLPNH GQVLLYMTPE LPLSPYLSHS NGEIDEWQKM SGGVNFRMRA HTPLEMTVAS
EGECHINWSG GTLEGQREGQ GWKFVFPVAD SGDATLVCS