Gene Nmul_A1454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1454 
Symbol 
ID3785545 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1661588 
End bp1662859 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content53% 
IMG OID637811542 
Producthypothetical protein 
Protein accessionYP_412149 
Protein GI82702583 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.15372 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGAAA CAAACGTAGC AAGCGGAAGT TCACTGGCAG TCAAACATTA TAGCGCGGCG 
CTCTTCGCCA ACACGCTCAA AGGATCCACA GCGATTGACA GTCTTGTGGG CCCGGTCGAG
CCTTCAGTAG CAATGCAGAA AATTGCCGGT CAAACAAATC CGGGCATGCC TATCGTGCGA
ATCGATAATT TAATGAAAAG TGCTGGCGAT GTCGTATCGC TCGATCTGGT CGATACGGTG
GGCGGTGAAC CATTGATGGG CGACGTCAAT CGTGAAGGAC GGGGCAGCGC GCTTTCGTTT
TCCTCGATGG AGATCAAGAT CGATCTATCC AGTAAGGTTA TCGATGCCGG TGGCAGCATG
TCGCAGCAAC GCACCAAGCA TCAGTTGCGG GAAATTGCCC TGGCGCAATT GTCAGGTTAT
TTCCCCCGTC TCGACGCTCA GGAAACCCTG GTGCATCTTG CAGGAGCACG CGGTTCGCAA
ACCGGTTCGG ACTGGACAGT ACCGCTTCAA AGCGCGCCGA ACTTCAGTTC CATCATGGTG
AATCCCGTGA AGGCCCCTAC CTATAATCGT CATTTTGTAG TGAACGGCGC CAATCTGACT
TCAGGGGGAC AGCAGTTGGG ATCCGTCGTT TCAACGGACG CCCTGCGCTT GTCGCATCTG
GATCTGCTGC GGAAGAGGCT TGATGACATG GATCAGCCAT TGCAATCCGT CAAACTGGCA
GGGGACCGAG CCGCTCAGAC TTCCAAGATG TGGGTATTTC TCGCCACACC CAATCAGTAT
TCGCTCCTTT TGACCGAAGG TTCGTTACGT GCTTTCCAGC AGAATGCCAT CAATCGGGCG
GCATATTTTG ACGAGCGCCA CCCCCTGTTT GCCGGTGAGG TTGGGATGTG GAATGGCATT
CTGGTGATCA AGAATGAGCG TGCGATCCGT TTTATGCCTG GGGAAAGCAC AAAGATAGTC
ACCGCAGCAA ACGCGACAAC TGCCACAGAA ACCGATCAGG CTGTCAATGG AGCGTTGACT
GCCGGGTACG CGATCGAGCG CGGATTATTA TTAGGCGCAC AAGCACTGGG GGTCGCTTAT
GGCAAAACCA GGGTCAGCGG AATGCAGTTC GGATGGAAGG AGCATTGGTA TAACTTTGAA
AGTAACCTGG AAGTGATGGG TGAGAAGGTT TGCGGCAAAG CGAAAACCCG TTTCTCTATC
GACGATGGAA CAGGTTTCAG GGTACCCACC GACTTTGGTG TGATTGCGGT TGACTCGGCT
GTGCCGCTTT AA
 
Protein sequence
MAETNVASGS SLAVKHYSAA LFANTLKGST AIDSLVGPVE PSVAMQKIAG QTNPGMPIVR 
IDNLMKSAGD VVSLDLVDTV GGEPLMGDVN REGRGSALSF SSMEIKIDLS SKVIDAGGSM
SQQRTKHQLR EIALAQLSGY FPRLDAQETL VHLAGARGSQ TGSDWTVPLQ SAPNFSSIMV
NPVKAPTYNR HFVVNGANLT SGGQQLGSVV STDALRLSHL DLLRKRLDDM DQPLQSVKLA
GDRAAQTSKM WVFLATPNQY SLLLTEGSLR AFQQNAINRA AYFDERHPLF AGEVGMWNGI
LVIKNERAIR FMPGESTKIV TAANATTATE TDQAVNGALT AGYAIERGLL LGAQALGVAY
GKTRVSGMQF GWKEHWYNFE SNLEVMGEKV CGKAKTRFSI DDGTGFRVPT DFGVIAVDSA
VPL