Gene Nmul_A2104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2104 
Symbol 
ID3784675 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2395305 
End bp2397989 
Gene Length2685 bp 
Protein Length894 aa 
Translation table11 
GC content54% 
IMG OID637812192 
Producthypothetical protein 
Protein accessionYP_412789 
Protein GI82703223 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.788762 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCACTTA TCGCCCCCGA CTACAACACC GCCTTTACCC GCGATGTTCT CGGACGTTAC 
GACTGCAATA CCTTTGACGA AGCGCTCGTT ACCATAGATC CAAACGCCCG TGCCGCTGCC
GGATTGCCTG GCCGAGGCGA TCTTCGGCCG TTTGATTTCA TCATAATTGG CGGGGGCACA
TTTGGTGCGG CTATTGCTGA ACATCTGTGG TTCCGAAGCA CTCAGCGCAG TGAGCGTATT
CTCGTGCTTG AGGCTGGTCC CTTCCTCCTA CCGGAACACG TGCAGAATCT ACCAATGCTC
GGCCTCGATA CCGCTGGCGC CCGCAAGGAT CATTCGCTGC AAGGCGAAGT CTGGGGCCTT
GCCTGGAATT CAAGTGATCC TTTCGGTTTT CCTGGGCTTG CTTACTGCCT GGGCGGACGC
TCGATTTACT GGGGTGGGTG GTTGCCCCGA TTGCTTGACG CAGAAATGCC AAAAGACGTT
TGGCCAAAAG GGGTGCTCGA CGATTTAACG GCAAAGGTTC TGCCAGATGG TCGAAAGGGT
TATTTCCGGC AATCAAGCGA TCAGATCGGG GTCACAGAGA CGAATGATTT CATTTTTGGC
GATTTACACC GGGCGATGCG GCAACAGCTG TTCGGAGCCC TCTCCTCGAT CCCTGATGTA
TATGATCTTG CGTTGCTGCC CGATCATCCC GCAGTCTATT ATGGCAACAC TCCAGTAACA
ATCAACGATT TGGCCAAGCT GCTTGGCATC GATAAATTGC CGAGTCCGCC GCCTTCTATC
CAGAAGCTCC GCGATGAAGC AAAACTTGAA GCACCATTGG CCGTGCAGGG ACAAGTGGGG
CACGCCGGGT TTTTTCCGTT TAACAAATTC AGCGCGGTTC CGTTACTCAT GAAGGCAGCC
CGCGAGGCGT CAACCCAAAC CCCGTTCGAT GGAGTAATAA ATCGGCTTGA TGACGTAAAA
AAACGGCTAA TGGTCGTGGA ACGCTGCCAC GTCACGCAAC TCAACACTGT CAATACCGAA
GGGGGGCGAC GCGTGAGCGA GATCGTTACC GAGCGGGGAG TGCTGCCTGT TTCCCCGGAC
AGCAAAGTCA TCATTGCTCT CGGCACGATC GAGAGCGCTC GTCTGGCGCT AACTTCCTTT
GGGACGGATG GAAAAATTGG CAGGAACCTG ATAGCCCATT TACGCTCGAA TGTCGATTTT
CGAGTTCCCC GCGCGGCACT GGCAGCGCTG CCGCCGGCAG TCAAAGCCCT CGAAGCGTCG
GCGCTTTTTG TAAAAGGCCG ACATGAGTTC AAGAAGGCCG ATGGTTCCGC AGATGGTTTT
GCCCACTTCC ATTTCCAGAT TACTGCGTCA GGTCTGGGCA CCAACGGGAC CAATTCTGAA
GCTGAGCTCT TCAAGAAAAT CCCGGACATT GACCTGTTCG ATGCACACAA GAATGCGACT
GACACACACG TCGTCATTAC GATTCGCGGC ATCGGTGAGA TGGAGCCCAA GAACGACAGT
GACTTCGGTA GCTCGGTTAC GCTCGATCTT GATCCGCAAC AAAAAGACGA ATTCCAGGCA
CGGCGAGCAT TTGTGAACCT CCAGCCCAGT TCTCGCGACT ACGAACTGTG GGATGCGATG
GATAAGGCGT CGGATGAGGT CGCGAAAGTC TTTGCCAACG GTCAGAAAAT CGATGTCATC
AAGAACGGAA ATGTGATTGT GACGAATATT GATCCGAGTT CACTTGCAAC GGTTCTGCCA
TATAAATTCT CTGATCAGGC CGGTCGGGGT CGCCGCGATG GTCTGGGCAC GACACATCAT
GAAGCCGGAA CGTTGCGAAT GGGAGAGGAT CCGAACAAAT CGGTGACCGA TCCCGATTGC
CGTTTCCATA ATGTGACAAA TACTTATGTC GCTGGACCCG CACTCTTTCC TACCATGGGT
TCACCAAACC CGATGCTCAC AGGCATTGCC CTGGTCCGCC GACTAGGAGA TCACCTGATG
CCGGAGCCGC CACTCTTGGT ATCGGAACCA GGTTTCACGT ACCTATTTGA CGGAAGCGAT
GCGCAGTTCG CGAATTGGGA GATGGCTGGA GGAGGCTCGT TCTCCCGGTT TGGCCGCACG
ATAATTGCCC AGCAGAATGA AAAAGGCATG GGCTTGCTCT TCTACAAGCC AAAACCCTTT
GAGGATTTCA CACTACGCCT CGATTTTCTT CTTCCTCACC CGCGAGGCAA TGGCAACGAT
AACTCGGGTG TCTTCGTTCG GTTCCGTGAT TCGCGCCTGC CTGATCCCGC TCCCGATCCG
GTCGATCCCG CAGACAATGC GGCCTTCGTT GCGGTTCATA CTGGGTTTGA AATCCAGATC
GATGAAGAAG CACGGGGTGA TACGCGTTTC GGCGAGCCGG ATGGTTCATT TTTTGCGAGA
ACCGGGGCAA TTTATAAAAT CAAGTCATTG GGGACCGGCG AAGGGCAGCA GAATTACCAA
AACAACATCT CTCTTGCGGC ACGACAATGG CATCACTACG AAATCGAGGT AAAAAAGCAG
GATTACGTTG TTCGGCTCAA CGGTCAGGAG GTAACCCGGT TTAAGCGTAG CCCCTCCGAT
ACTGCTAGAG GGAATCCACC CAGCGTTGAT CCCAATTCAG GTTATATTGG GCTGCAAACG
CATACAGGGA ACGTAGCTTT CGCGAATGTT CGGATCAATG CATAG
 
Protein sequence
MPLIAPDYNT AFTRDVLGRY DCNTFDEALV TIDPNARAAA GLPGRGDLRP FDFIIIGGGT 
FGAAIAEHLW FRSTQRSERI LVLEAGPFLL PEHVQNLPML GLDTAGARKD HSLQGEVWGL
AWNSSDPFGF PGLAYCLGGR SIYWGGWLPR LLDAEMPKDV WPKGVLDDLT AKVLPDGRKG
YFRQSSDQIG VTETNDFIFG DLHRAMRQQL FGALSSIPDV YDLALLPDHP AVYYGNTPVT
INDLAKLLGI DKLPSPPPSI QKLRDEAKLE APLAVQGQVG HAGFFPFNKF SAVPLLMKAA
REASTQTPFD GVINRLDDVK KRLMVVERCH VTQLNTVNTE GGRRVSEIVT ERGVLPVSPD
SKVIIALGTI ESARLALTSF GTDGKIGRNL IAHLRSNVDF RVPRAALAAL PPAVKALEAS
ALFVKGRHEF KKADGSADGF AHFHFQITAS GLGTNGTNSE AELFKKIPDI DLFDAHKNAT
DTHVVITIRG IGEMEPKNDS DFGSSVTLDL DPQQKDEFQA RRAFVNLQPS SRDYELWDAM
DKASDEVAKV FANGQKIDVI KNGNVIVTNI DPSSLATVLP YKFSDQAGRG RRDGLGTTHH
EAGTLRMGED PNKSVTDPDC RFHNVTNTYV AGPALFPTMG SPNPMLTGIA LVRRLGDHLM
PEPPLLVSEP GFTYLFDGSD AQFANWEMAG GGSFSRFGRT IIAQQNEKGM GLLFYKPKPF
EDFTLRLDFL LPHPRGNGND NSGVFVRFRD SRLPDPAPDP VDPADNAAFV AVHTGFEIQI
DEEARGDTRF GEPDGSFFAR TGAIYKIKSL GTGEGQQNYQ NNISLAARQW HHYEIEVKKQ
DYVVRLNGQE VTRFKRSPSD TARGNPPSVD PNSGYIGLQT HTGNVAFANV RINA