Gene Nmul_A0920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0920 
Symbol 
ID3786465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1045267 
End bp1046967 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content59% 
IMG OID637811002 
Producttyrosinase 
Protein accessionYP_411615 
Protein GI82702049 
COG category 
COG ID 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCCAA AATTGTTCTC ATCCACCATG TCCCGCCGCA CATTCCTGAA AGCTGCGGCG 
GCGACCACCG CCGCCATCGG CGCTTCTGCC CTGCCCTTCG GAGCCCAGGC ACAAGGGAAA
GCAAAGTATC GGCGTTTGAA CGTTCTGAAC CCTGGGGCAA AGCGCGCCAT CGAAAGCTAC
AAGAGGGCTA TCGCCAGGAT GCTCAAGCTC CCGCCGGAAG ACCCTCGCAA CTGGTATCGC
ATCGCGCTCA CTCACACGAT GGATTGCCCG CACGGCAACT GGTGGTTTCT GGTCTGGCAC
CGCGGCTATA TCGGCTGGTT CGAGCAGATT TGCCGTGAGC TCAGCGGCGA CCCAGGGTTT
GCTCTTCCCT ACTGGGATTG GACGGAGAAT ACGGACCCCG ACAGTCCCTT TCAGGCACGC
GTGCCCGCCG TCATGTTCGA GGATGTGCTC ACCCCCGCTC ATCCGGCCTA TATTGCAAAC
TCGCGCGAAT TTCAGAACCG CTTTCGCGGG GTAATCGCCA GGGCGGATTA CTGGAAGCGT
TTTTGCGGGC CGAATGGTGA ATTCGATGAT GAGACGCAGT ATGGTCAGCT CCTCGCCCGG
GGAATCCGCT CCCCCGAGGA TCTGTGGTTC GATATGCTAA ACGATCCGAG AGGCCGCTTT
TTTTTCGATC TGAAACAGGC GCGCGGCACG ACCCGGGAAA AGCCGGAGCT CGATGGAAAA
ACAACGAAGG CCGTCTCCCT GCAAACATTG CTGGACGCGC TGGCCCCTCG CGATTTCCTC
ACGTTTGCCA GCCCAAAGAC CCTTGGTCAC AGTGCCCTCA CCGGATTTGG CGTGCTGGAA
GGACAGCCGC ACAACAGGGT GCACAACTGC GTCGGCGGCA TCTTTACCGA CCCCAATGGC
AACACCACCA ACAACGGCGG CTTCATGCAG GCCAATCTAT CGCCTGTCGA CCCGCTTTTT
TTTCTGCACC ATGCGAATAT CGATCGGCTT TGGGATGTAT GGACCCGGAA GCAGTTGGCG
AGGGGATATC CTGCCTTGCC CGAAGGCGCG GATTTCGACG CCTGGTCGAG GGAACCGTTT
CTTTTCTTTG TCGATGCAAA GGGAAAGCCG GCGAAGAAAA GAACCGCCGG GGACTACGCG
GCTATCGGGG ATTTCAATTA CGATTATGAG CCCGGCTCCG GGGAGGAAGT GGTGGCGCCT
CCCATGTTCG CCTCACTGCT GGGCGCAGCG GTACCCTCCG AGAGCACCCG GGCCCAGATC
ACCCGTTCCG TGGTGAGCGG GGAGCAGGCG GCAAGCGCGG TCGTGACACT TCCGTCTCCG
CTGCTTGGCT TGCGCGCACA GGCGGAAACG CCGCGATTGT ATGCAAAGAT CACCCTGGCG
CTGCCGCCGC TGGCGCACCA TCATGATTTT GCCGTGATGG TGGATGACGG GAACAGTCGA
ACGGACCCCT CCAGTCCTCA CTACGTCGGT ACGCTCTCGA TGTTCGGTCA TCACACCATA
CAGGCTCCGG TTACCTTTAC CGTGCCTTTA TCGGGCACGA TCGAGGCAAT GCGGCAGAAC
GCGCAGCTTA CAGACAGCGG GGCGTTGAAT ATCCGGATTG TTTCGGAGCG AATGGTAAAA
CCGGGAGTAC CGATGGCAAG ACATGCCCCT GGCACGGAAC CGAAAGCGGA GGTACTTTCC
ATTGTTGTGG AGGCCCATTG A
 
Protein sequence
MQPKLFSSTM SRRTFLKAAA ATTAAIGASA LPFGAQAQGK AKYRRLNVLN PGAKRAIESY 
KRAIARMLKL PPEDPRNWYR IALTHTMDCP HGNWWFLVWH RGYIGWFEQI CRELSGDPGF
ALPYWDWTEN TDPDSPFQAR VPAVMFEDVL TPAHPAYIAN SREFQNRFRG VIARADYWKR
FCGPNGEFDD ETQYGQLLAR GIRSPEDLWF DMLNDPRGRF FFDLKQARGT TREKPELDGK
TTKAVSLQTL LDALAPRDFL TFASPKTLGH SALTGFGVLE GQPHNRVHNC VGGIFTDPNG
NTTNNGGFMQ ANLSPVDPLF FLHHANIDRL WDVWTRKQLA RGYPALPEGA DFDAWSREPF
LFFVDAKGKP AKKRTAGDYA AIGDFNYDYE PGSGEEVVAP PMFASLLGAA VPSESTRAQI
TRSVVSGEQA ASAVVTLPSP LLGLRAQAET PRLYAKITLA LPPLAHHHDF AVMVDDGNSR
TDPSSPHYVG TLSMFGHHTI QAPVTFTVPL SGTIEAMRQN AQLTDSGALN IRIVSERMVK
PGVPMARHAP GTEPKAEVLS IVVEAH