Gene NmulC_2789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmulC_2789 
Symbol 
ID3786789 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007616 
Strand
Start bp9223 
End bp10275 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content50% 
IMG OID637812898 
Producthypothetical protein 
Protein accessionYP_413485 
Protein GI82703921 
COG category[L] Replication, recombination and repair 
COG ID[COG5534] Plasmid replication initiator protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGTCA ACGAAACCAC GAATATTAAA AAGAATGAGC GATCGGTCTT GTTGCCGGAA 
CGCCATCCCA ATCACGAGCT GTTCATCTGC GATGTGCTTG AGGCCATCCC CAAGGATGAC
TTGGCAAGCA TGGAACACCC CGTATTCTCC CTGGCAACCA AACCAGATAC ACGGACCCTG
ATTTATGAGC ACAGGGATGT AAAAATCCAG ATCACGCCAA GCGTTAAGGG GTTGGCGACG
ATTTTCGATA AGGATTTATT GATTTTCTGC ATCTCCCAAA TGATCGCAAA AAAGAATAGG
GGAGAGCCTC TATCGCAAAA TGTACGCCTC CATGCATACG ATCTTTTGAT ATGGACGAAC
CGGGAAACAA GCGGCGATGC TTACCGGCGC CTCATAGAAG CATTTGAGCG GCTACGCGGG
ACCACAATCG TGACAAACAT CAAAGCAGAC GGTGAGGAGA TAACCACAGG TTTTGGCCTT
ATCGATAGTT TCAAGGTTGT TCGTCATACC GCCACCGGGC GCATGAGTGA GCTGGAAGTC
CGGATCTCGG ACTGGATGTT TAAAATCATT CAGGGTTCAC AGGTGCTGAC GCTGAGCCGG
GATTATTTCC GGCTTAGAAA ACCCATCGAA CGGCGGATTT ACGAGATAGC ACGTAAGCAT
TGCGGGGAGC AGGACGAATG GCGGATTTCT ATCGAACTGC TCCAGAAAAA AACTGGAGCC
AGCAGTCACG AACGGGCGTT TAAAGCCATG GTGCGGGAGC TGGTCAAATG CGACCATTTG
CCCGACTACA GCGTCACATT GGAAGACGAT ATGGTAATTT TTTATAACCG GGCGGGCTTA
TCGGAGAAAA TTCCTCTTAC CGCGTTTCCT CAGCTCAATG CTGAAACTTA CAACGATGCT
CGTACCGTGG CCCCAGGTTA TGACGTCTAT TATCTCGAAC AAGAATGGCG GGACATGTGG
GTTGATACCG GAATGCCGCT ACTCCACAAT CCCGACAAAG CTTTTATAGC TTTTTGCAAA
TCCCGGGCAA AACGTCGCCC AATGGGTCGG TAA
 
Protein sequence
MSVNETTNIK KNERSVLLPE RHPNHELFIC DVLEAIPKDD LASMEHPVFS LATKPDTRTL 
IYEHRDVKIQ ITPSVKGLAT IFDKDLLIFC ISQMIAKKNR GEPLSQNVRL HAYDLLIWTN
RETSGDAYRR LIEAFERLRG TTIVTNIKAD GEEITTGFGL IDSFKVVRHT ATGRMSELEV
RISDWMFKII QGSQVLTLSR DYFRLRKPIE RRIYEIARKH CGEQDEWRIS IELLQKKTGA
SSHERAFKAM VRELVKCDHL PDYSVTLEDD MVIFYNRAGL SEKIPLTAFP QLNAETYNDA
RTVAPGYDVY YLEQEWRDMW VDTGMPLLHN PDKAFIAFCK SRAKRRPMGR