Gene Nmul_A0403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0403 
Symbol 
ID3785396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp446083 
End bp447147 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content53% 
IMG OID637810479 
Productpermease YjgP/YjgQ 
Protein accessionYP_411103 
Protein GI82701537 
COG category[R] General function prediction only 
COG ID[COG0795] Predicted permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0309466 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAATAA TTTACCGCTA CCTGGGGCTC CAGGTTTTGA TGGGTCTGGG AATCGCTACC 
GCGGTGCTTC TGCCGCTCTT CGGTTTCCTT GATTTGCTGG ATCAGCTCGA TGACGTGGGA
AAGGGCACGT ACAGCGTCAA GGATGCGTTT CTTTATACGG CTCTACTGCT GCCGCGACGG
TTTATTCAAC TCGCGCCTTT CATTGCATTG ATGGGCAATG TAACTGCATT GGGAAGGCTT
GCCGTCGGAT CGGAACTGAC GGCATTACGA GGAGCAGGGG TATCTCCGCT GGGCATCAGT
CTCGCGCCGG TTGTCGTTGG AATAATTCTT TTACTGTTTA TTACTGTACT GGATCAGTTC
GTGGCGCCGC AATTTCAGCA GAAAGCCATT TCATCCCGCG CAGCCGCGCT CGAGAAGAGC
GCCGCGCTTG GCCAACAATT AGGGATATGG ACGCGGGATG AGCGGAATAT ACTGCGAATC
GGAGAAATGC TGCATGCGAG AAGGGCGGCG AACATCGAAA TAATGCATTT TGACGACAAT
GGCTTCCTGT TACGCTATAC GTATGCCAAG TATGCTGATA TCATAAACGA GGGGTTGTGG
GAGTTAAGGG ACGTCGTCAT CAAGACATTC AACGGCAATG CCATGGAGAT CGTAAGCAGA
GAATCGGTAC CCTGGGAACC CTTCCTGAAG GAGGAGGATA TCTCGACGTT GACCAAATCG
CCGGAAAGTC TCTCACCCGC CGAGTTATTT TTGCATGTGC ATTTTCTGCG CGCCACGGGT
CAGGAATCGG GCGCTTATGA GCTGGCGTTG TGGCGCAAGG CGGGTGGTGC CCTGACGACC
ATCGCGATGC TGTTGCTTTC GATTCCCTTT GTTTTTGGAT CGGTGCGGGC AGGGCTCGGC
AACCGACTCG TGGTTGCATC GATGCTTGGA ATCAGCGTCT ATCTCTTCGA CCAGATCACT
GCCAATGCCG GCTTGTTACT GCATTTGAAT CCGGCGCTGA GCGCACTTCT TCCAGGAGGG
GTGCTGATCG CCGTAGCTTA TTTCTGGTTA CGGCGAATTT TTTAA
 
Protein sequence
MTIIYRYLGL QVLMGLGIAT AVLLPLFGFL DLLDQLDDVG KGTYSVKDAF LYTALLLPRR 
FIQLAPFIAL MGNVTALGRL AVGSELTALR GAGVSPLGIS LAPVVVGIIL LLFITVLDQF
VAPQFQQKAI SSRAAALEKS AALGQQLGIW TRDERNILRI GEMLHARRAA NIEIMHFDDN
GFLLRYTYAK YADIINEGLW ELRDVVIKTF NGNAMEIVSR ESVPWEPFLK EEDISTLTKS
PESLSPAELF LHVHFLRATG QESGAYELAL WRKAGGALTT IAMLLLSIPF VFGSVRAGLG
NRLVVASMLG ISVYLFDQIT ANAGLLLHLN PALSALLPGG VLIAVAYFWL RRIF