Gene Nmul_A2598 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2598 
Symbol 
ID3785479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2982661 
End bp2984181 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content55% 
IMG OID637812687 
Producthypothetical protein 
Protein accessionYP_413277 
Protein GI82703711 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGTGGA TTGCCCTTTA CTTTCCCGCT TTATCGCTTG ACTGGGTTGA ACGCCGGTTT 
CCGGAAGCGC TCATTCCCGC AATCGGAGTA ACTGTCCGCA AAGGGAACCA GATCTGTATT
CAACAAGCCA ATAAGCCGGC GCAGGCGCGG GGAGTAATGG AGGGTCAGCC TCTCGCCAGT
GCTTTGGCTG TTTTCCCGGA TCTGGTGATC ATGGAACAGG ACTCGCATGA AGAAGGAAAA
GCCCTGCAGC AAGCCGTGTA TGCCGCATTA CGCTTTACAC CCAATATAGC GATCCAGAAC
AGCGGCCTGA TTGCCGAGGT CTCCGGAAGC CTGAAATTGT TTGGCGGCCT GAAAAAGCTC
TGCCAGTCGC TCAATCGGGT AGTGACTGCG CAAGGTTTGC AGCTCAGCGC AGGGATTGCG
CCCACCGCAA CGGGAGCATG GCTGCTGGCC CGTTCCGCCT CGTCGGGCAC TGTCATCAAT
GGGAAGGGTG AGGAGTTCCG GATATTGCTC GACGCCTTGC CTGTCGGTTT TCTGGAATCG
GCTCAGCCTC ATCTTGAAGT CATTCGCGGG ATCGGCTGTA AAACACTGGC CGATTTGCAG
CGATTGCCTC GCAGCGGAGT AGCGCGTCGC TTTGGTCAGA ACCTGCCGGC AGAGCTCGAT
CGCGCCTACG GTGACGCGCC CGATCCACAA AAGTGGTTCG AGGCGCCGGA AGATTTCCAG
CAAAAAATGA AAATGATGGG GCTGATTGAA AATGCCGGAT TGCTGCTGGT TCCTGCGCAA
CGAATGGTCG AGCAGATGTG CGGCTGGCTG GCTTTGCGTC ATGCGGCGGT ATCCGCCTTT
TCATTCGTGC TGCATCACGA ATATTCCCTG CGGCAACCCC ACAAATTTAC ATCCATAAAC
ATACACCTTT CCGAGCAAAG CAGCGATCCG GCGCATTTAA TGCTGTTGCT GCGCGAGCAT
CTGGAACGTA CAAAAATAGT GGCCCCAGTA TGTGAACTGG AACTGACGGC AGATGAAATA
GCGGCGGGAG CAGACGGCAA TCTGGAATTG TTTCCCACCA TGCAATCCGA GACTACTTCA
CTCAATCGCT TCATCGAGAA ATTTTCTTCC CGCCTGGGAC CGGAAGCCAT CACCGGTTTA
AAGGTGGTTT CCGATCATCG CCCTGAATAC AGCCAAAGGT TGGAACTCTC AGGGAGGGGT
GCCTTGAATC GCTTTTCGAA GCGAGGGAGA AACTCGCAAA TCATTCCGCC GGAATCGCCC
CGTCCAGCCT GGCTGATGGA AATCCCGCTG GAACTGAAGG TGCAACGTGG CCGGCCGGTG
TATGAGTCGC CACTGAAACT GCTTGCAGGG CCGGAGCGAA TCGAGGCCGG CTGGTGGAAT
GATGACGCCA TCGCGCGGGA TTACTTCATT GCGGAGAACG ACCAGGGCCA ATTGTTATGG
ATTTACCGCG AACACAATCC GGTAGAAAAA GATAAGGGAA ACAAAGACGG AAACTGGTAT
TTGCAAGGAT TGTTTGGATA G
 
Protein sequence
MLWIALYFPA LSLDWVERRF PEALIPAIGV TVRKGNQICI QQANKPAQAR GVMEGQPLAS 
ALAVFPDLVI MEQDSHEEGK ALQQAVYAAL RFTPNIAIQN SGLIAEVSGS LKLFGGLKKL
CQSLNRVVTA QGLQLSAGIA PTATGAWLLA RSASSGTVIN GKGEEFRILL DALPVGFLES
AQPHLEVIRG IGCKTLADLQ RLPRSGVARR FGQNLPAELD RAYGDAPDPQ KWFEAPEDFQ
QKMKMMGLIE NAGLLLVPAQ RMVEQMCGWL ALRHAAVSAF SFVLHHEYSL RQPHKFTSIN
IHLSEQSSDP AHLMLLLREH LERTKIVAPV CELELTADEI AAGADGNLEL FPTMQSETTS
LNRFIEKFSS RLGPEAITGL KVVSDHRPEY SQRLELSGRG ALNRFSKRGR NSQIIPPESP
RPAWLMEIPL ELKVQRGRPV YESPLKLLAG PERIEAGWWN DDAIARDYFI AENDQGQLLW
IYREHNPVEK DKGNKDGNWY LQGLFG