Gene Nmul_A1610 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1610 
Symbol 
ID3784842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1848630 
End bp1849571 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content60% 
IMG OID637811699 
Producthypothetical protein 
Protein accessionYP_412303 
Protein GI82702737 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACCGC GTGCGCACCA TGTACTGATC GGATTCTTTA CCGTTATGAC GGTAACGGCT 
GCACTGCTGT TCACTCTGTG GCTGAGCAAG GCTCCGGGCG ACTCCGTGCA GCGCTACTAC
ACTGTGGTCT TCAATGAAGC AGTCAGGGGG CTCTCCATAG GTAGCCCCGT TCAGTACAGC
GGCATCACGG TCGGTGACGT GGTCAATCTC GCGCTGGATC CGCAGGATCC CCGCAACGTG
ATCGCGCGGG TGCGCGTGCA GGGCAGCACG CCGATCAAGG AGGATACCCA GGCACGACTC
GCACTGACAG GCATCACCGG CAATTCGGTG ATCGAATTCA GCGGCGGGTC TCCCGACAGC
CCCGACCTCG TGGCAAAGGA TGACCACAAG GACCCGGTCA TCGTGGCCAC CCCATCGCCC
ATCGCCAAGC TGCTGGAGCA CAGCGACAAC ATGATGGCCG ATGTCACCCA GCTGGTGATG
CGGGCCAAGG AGATCCTTTC TCAAGAAAAT GCCAAGCGGC TGAGCAGGAC GCTGGAGAAC
CTAGAGCAGA CTACTGCAGT GATCGCCAGC CAGAACGATA GCGTGCGGGG AATCGTGGGT
GAACTGGCCA CTGCCAGCGC ACAGGCGAAC TCCGCATTGC GGGAGGCTAC GCAACTGATG
GCGGCAACGA ATACGCTCGT GAGCGAGAAG GGCGTTCCAA CTCTTGGCAA CCTCGATCGC
GCCACAGCTT CCCTGGCGAA AGTCAGCGCG TCGGTCGATC AGTTGCTGCT GGAGAATCGG
GCCGCGTTGA GCGGGGGCAT GCAGGGCATG AACGAACTGG GACCTGCCCT CCAGGAACTA
CGCAATACCA TGTCTGCACT GGCAAGAACC GTACGTCGTC TCGATGAGAA TCCTGCTGCC
TACCTTACGG GGCGGGAAAA AATCGAGGAG CTTGAACCAT GA
 
Protein sequence
MEPRAHHVLI GFFTVMTVTA ALLFTLWLSK APGDSVQRYY TVVFNEAVRG LSIGSPVQYS 
GITVGDVVNL ALDPQDPRNV IARVRVQGST PIKEDTQARL ALTGITGNSV IEFSGGSPDS
PDLVAKDDHK DPVIVATPSP IAKLLEHSDN MMADVTQLVM RAKEILSQEN AKRLSRTLEN
LEQTTAVIAS QNDSVRGIVG ELATASAQAN SALREATQLM AATNTLVSEK GVPTLGNLDR
ATASLAKVSA SVDQLLLENR AALSGGMQGM NELGPALQEL RNTMSALART VRRLDENPAA
YLTGREKIEE LEP