Gene Nmul_A1956 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1956 
Symbol 
ID3785134 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2247040 
End bp2248287 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content52% 
IMG OID637812044 
Producthypothetical protein 
Protein accessionYP_412643 
Protein GI82703077 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCTACA CGTCAAGTAA CCCCATTCCC CGCTGTTTGA CGCGAGCGAT TATCCGGCTC 
GTCGTGGTGG GTGCTGCCCT TTCGAATTGT ATTGCTTACG CGCAAAAACT GCCGATTCCC
CAGAACCTGC CGAACAACCT GCAAATACAT GGTTTTGTTA GCCAGAGTTG GCTCAAGAGC
ACGGACAACA ATAACGTTTT CGGGAAAAGC AGTTCCGATA GTGGAAGTTT TGACTTCAGG
GAATTGGGAC TGAATGCTTC CATGAGGCCC AAGCCCAACC TCCAGTTTTC GGCTCAGATG
ATTTCCCGCA CTGCGGGAAA AGGAAGTCCG GGCAATATCC GGTTCGATTA CGGATTCATC
GATTATGCGT TTCTATCGGA AGAAAACAGC AAGATAGGAA TACGCCTGGG CAGGATGAAG
AATCCGCTCG GCTTTTACAA CGACACACGC GACGTTCCCT TCACGCGGCC CAGCATACTT
TTACCTCAAT CCATCTACTT CGATCGCACG CGCAAATTGG GGATTGCAGC AGATGGGGTA
CATTTATACG GTGAATACCG TTTCGAGCAC GGCGTCCTGT CTTTTCAGGG TGGACCGGTA
CGTCCACTAG TCAGAGGTGC TGAAGCAGAA GTAGCGCTGT TGGGCCAGGG GATGCCAGGC
CATCTTGCTC CAGACATCTC CTATATTGGC CGTATAAGCT ACGAACTCGA CGAGGGCCGG
CTTCGTTTTG CAGTCAGCGG AACGAATTTG AACATAGATT ATGACCCCGC AAGCGGGGAC
CGGCTTGGCG CGGGCTCAAT TCGCTTTACA CCTCTCATTT TCTCCGCGCA ATATAACGCA
GAACGCTGGA GTTTCACTTC GGAATACGCG ATACGCCATT TCGAATATAA AAATTTTGGC
AGGGCGGCCC TCAATCTGGA TTTCTTTGGC GAAAGCTATT ACCTTCAGGG AGCTTATCGA
ATCACGCCGG AATTGGAAGC GATCGCTCGC TACGACGTAC TGTATACGGA CAGTAATGAC
CGCAGCGGAA AAAAATGGGC AGCGGCTACA GGCGGCGATC CGCATCGGCG GTTTGCTAAA
GACATTACCG TAGGGTTGCG CTGGAACGTC ACGCCTGAGT TCATGTTGCG CGCCGAGTAT
CACCGCGTGA ACGGAACAGG CTGGCTTTCG ACTCTTGACA ATCCCAATTC GGGAGACCTT
TCACCGCATT GGAATCTGTT TTCCATTCTC GGCTCCTACC GGTTCTAG
 
Protein sequence
MFYTSSNPIP RCLTRAIIRL VVVGAALSNC IAYAQKLPIP QNLPNNLQIH GFVSQSWLKS 
TDNNNVFGKS SSDSGSFDFR ELGLNASMRP KPNLQFSAQM ISRTAGKGSP GNIRFDYGFI
DYAFLSEENS KIGIRLGRMK NPLGFYNDTR DVPFTRPSIL LPQSIYFDRT RKLGIAADGV
HLYGEYRFEH GVLSFQGGPV RPLVRGAEAE VALLGQGMPG HLAPDISYIG RISYELDEGR
LRFAVSGTNL NIDYDPASGD RLGAGSIRFT PLIFSAQYNA ERWSFTSEYA IRHFEYKNFG
RAALNLDFFG ESYYLQGAYR ITPELEAIAR YDVLYTDSND RSGKKWAAAT GGDPHRRFAK
DITVGLRWNV TPEFMLRAEY HRVNGTGWLS TLDNPNSGDL SPHWNLFSIL GSYRF