Gene Nmul_A1237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1237 
Symbol 
ID3785576 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1422290 
End bp1423714 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content49% 
IMG OID637811322 
Producthypothetical protein 
Protein accessionYP_411932 
Protein GI82702366 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAATA CTAAAAATTC CTGCATGATT GTGCAACCGA TTATCAGTCG GGTAACAAAA 
GAAACTTTGG TAATGAAGAA CTTCATGCAA AGAGCAGATA AGGTACGAAA TAATGAATCG
GCACCGGTCG AGGTGCGCCC GCGGAAACTT TTCTGCAAGC CACGGAAATT TCTCGTTCAT
GTTTGCAGCG TATCGCTTGC GATACTGGCG CTGATTATGG ATTCCCTGTT TCTCTCCTCG
TGGGCGCAGG CAGCAGAAGT CACGCTTCCA AAAGGTCCAA AGTATGCAAT CGACGAAACT
AAGTGGGTGA CGCTTGGAAT AGGTTTCCGT GGAACCGGGC TATGGGTGGA AAATCCTGCC
ACGGGTAATC TCAGGAGCGG CGATTTCAGC ATTGATAATG CCCGTTTTTA CTTGAATGGG
CAGATACATC AGTACCTCAA GTTCGAAGTC AATACGGAAT GTTTTTTCTG CAACAACACC
CATCCCGGGG ATAATCCGAA GATGTCGTAC AACGTACTCG ATGCGATCGG AAAATTTGAG
CTCAACCGTT ATTTCAACAT CTGGGGGGGC CGTATGCTGG TGCCGACCGA ACGGGGCGAA
CTGAGCGGTC CTTTTTTTCA ATCAACACAT GACGCCTTCA AGACACCTTT TTTTTCCCAG
GATTTCAGTA CCAAATTTGG CAGCGGTGGA GCTGGACGCT ATGGACGCGA CGATGGAGGC
ACATTCTGGG GAAGCCTTGA ACCTGGCTTC ATCAGCGGTA CTCTGGGATA TGCTGTCGGT
GTCTACCGCG GAGTCCAGTC ATCCCGTAGC GCCGGCCCCA ATCAGGGCGA TGACGTATTA
TGGGCTGGCC GTTTCACATA TAACTTCTTG AATCCAGAAA AGAATCCTGG TTACTATACC
AGCAGCACCT ATTTTGGCAA GGCCGGCGAT ATTCTCGCGC TTGCATTCGG CGTTTCATAC
CAGAAAAATG GCGCCGGCTC CTTCGCGCAT CGAAGTGATT TCCTGGGACT GGTCGGGGAC
GCCCTTTTTG AAAAAGTGCT ACCCCGGAAC ATGGGTGTAA TCACCGCAAA TGGTGAGTAC
AAGCAGTTTT ATGCCAACTA TTCACCTGCG GCGTTTCAAG ATCCAGATTG CTTTTGCATG
TTCGATGGTA AATCGTGGAC AGTCACCGGG CTTTATCTCC TGCCCATGAG GATCGGTATT
GGCCAGTTTC AACCCTATGG CCGATTCACA AGTATCCAGC CAAATAACAG CAGCAACAGG
GAAGAAATTG AAGCGGGTGT CAACTATATC ATTGACGGTT TCAATGCTCG AATTTCAGCG
TACTACCAAC ACGGGGATCT ATTCACCAAA CGGCTGAACT ATGCGCCGGA TGTGGCTGGC
GAGAAAGTTG ATGTATTCAA GCTGTCGTTT CAACTGCAAA TGTAG
 
Protein sequence
MMNTKNSCMI VQPIISRVTK ETLVMKNFMQ RADKVRNNES APVEVRPRKL FCKPRKFLVH 
VCSVSLAILA LIMDSLFLSS WAQAAEVTLP KGPKYAIDET KWVTLGIGFR GTGLWVENPA
TGNLRSGDFS IDNARFYLNG QIHQYLKFEV NTECFFCNNT HPGDNPKMSY NVLDAIGKFE
LNRYFNIWGG RMLVPTERGE LSGPFFQSTH DAFKTPFFSQ DFSTKFGSGG AGRYGRDDGG
TFWGSLEPGF ISGTLGYAVG VYRGVQSSRS AGPNQGDDVL WAGRFTYNFL NPEKNPGYYT
SSTYFGKAGD ILALAFGVSY QKNGAGSFAH RSDFLGLVGD ALFEKVLPRN MGVITANGEY
KQFYANYSPA AFQDPDCFCM FDGKSWTVTG LYLLPMRIGI GQFQPYGRFT SIQPNNSSNR
EEIEAGVNYI IDGFNARISA YYQHGDLFTK RLNYAPDVAG EKVDVFKLSF QLQM