Gene Nmul_A1420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1420 
Symbol 
ID3786618 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1630361 
End bp1631524 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content53% 
IMG OID637811508 
Producthypothetical protein 
Protein accessionYP_412115 
Protein GI82702549 
COG category[V] Defense mechanisms 
COG ID[COG0842] ABC-type multidrug transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGGGAA AAAACAGCCA ATCAGGTCAA CGGAATTATC TCGGCTTTCA ACTACGTCTC 
ATTTCCCTGA TTCGAAAGGA AATACGCCAA CTGTTGCGAG ACAAAAGCAA TCTCGCGGTT
GGAATTGGTC TCCCCATTAT ATTGATCCTC ATTTTCGGTT ACGGCTTGTC CCTCGATGTC
AGGAATGCGC CCGTTGCAAT CGTACTGGAA GATGCCTCCC CCACTGCGGT GGACGTTGTC
TCAGGCCTGG AACTTTCTCC GTATATTTCA CCTGTAAGAG TTCCATCCAT GCAGGAGGCA
GAGCGTTTGA TGCTGGCGCG AGAAATAGAG GGGATCTTCC GGATACCAAG CGATTTTTCC
CGACGATTGG GAGAAGGCGA AGCACAAGTC CAGTTTCTGG CCTATGGAAC AGATTCCACT
CGCGCAACAA TCATTCGAAG CTACATCAAC AGTATTGTTG GCCAGTGGAC GTTGCGGCAG
GCTGACCGAA AAGCGGCTTC ATCGGTCCCG GCCGAGGGTA GAATCACGCT GGTGGAACGC
TTGTGGTTCA ACTCCGCCAA CTCCAGTACA TGGTATCTGG TGCCAGGCTT GATCGTTCTG
ATCATGACCC TCATCGGGGC CTTTCTCACC GCGATGGTGA TGGCGCGCGA GTGGGAACGC
GGTACGCTGG AAGCATTGTT CGTCTCGCCC GTGCGCCCCA CGGAAATCCT GCTGGCGAAA
ATTGTCCCCT ATTTTCTGGT TGGCATGGCG GGTCTCGGGT TATGCCTGGC CGCAGCGCAT
TTTCTGTTCG AGGTGCCCAT GCGCGGCTCC CTGCTAATCC TGCTGGGCGG CTCGATGCTC
TATCTATTGG TGGCGCTGGG TTTCGGCCTG GTCATCTCGT CCATTACCAA AAACCAGTTC
ACTGCCAGTC AAATCGCAAT TATTACCAGC TTCATGCCCG CCCTCATGCT TTCCGGCTTT
CTTTTTGACT TGCGCAACGT GCCTGTTGTC ATTCAGCTGG TTGGAAAGAT TCTACCGGCC
ACTTATTTCA TGGAATTGAT CCGGACGCTG TATCTCGCCG GCAATGTATG GCCGGTGATA
ATAGAGAATT ATGCGATTCT CACAAGCTAT GCAGTGCTGT TGCTGGGTCT TGCGCGCTTC
GTCACCCGCA AGAAACTGGA TTGA
 
Protein sequence
MRGKNSQSGQ RNYLGFQLRL ISLIRKEIRQ LLRDKSNLAV GIGLPIILIL IFGYGLSLDV 
RNAPVAIVLE DASPTAVDVV SGLELSPYIS PVRVPSMQEA ERLMLAREIE GIFRIPSDFS
RRLGEGEAQV QFLAYGTDST RATIIRSYIN SIVGQWTLRQ ADRKAASSVP AEGRITLVER
LWFNSANSST WYLVPGLIVL IMTLIGAFLT AMVMAREWER GTLEALFVSP VRPTEILLAK
IVPYFLVGMA GLGLCLAAAH FLFEVPMRGS LLILLGGSML YLLVALGFGL VISSITKNQF
TASQIAIITS FMPALMLSGF LFDLRNVPVV IQLVGKILPA TYFMELIRTL YLAGNVWPVI
IENYAILTSY AVLLLGLARF VTRKKLD