Gene Nmul_A2687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2687 
Symbol 
ID3785049 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp3087056 
End bp3088270 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content57% 
IMG OID637812777 
ProductHemY-like 
Protein accessionYP_413366 
Protein GI82703800 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG3071] Uncharacterized enzyme of heme biosynthesis 
TIGRFAM ID[TIGR00540] hemY protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGGGGG CGCTCTGGCT TCTGGCCCTG TTCATGATTG CCGTGGCGGT TACCATCGCC 
GCCACTTATA ACAGCGGCTA CGTGCTGATT GTCGCCCAGC CCTATCGTAT AGAGCTGTCG
CTGAATCTGC TGGTGCTATT GCTGCTGGCC ATCATTTTGA TGGGTTACCT GGGAATGCGA
CTTATTGCAT TTACCGCCCG GCTGCCTGCC GAGCTGAGTG AATTTCGCAC TCGCAGGCGT
CGGGAAAAGG CTCTGGAAGG AACGCTGGAG GGTCTCAAGG CGTTTTTCGA GCGGCGTTAT
GCGAAGGCGG AGAAATCCGC CGCCACCGTC CTGAAAATGG AGGATTCCAC CGCTTTCAGC
GCCATCAATG CCATCGTTGC CGCGCGTGCA GCCCATGGAT TGCGAAATTA TTCCCGCCGG
GACGAGTTCA TTGCACAGGC CGAAACCAGC GCGCCGCAAG AAGTGGCATT GCGGCTGATG
ACACAGGCTG AATTGCTTCT GGACGAACAT CGACCTGAAG AAGCGCTCCG GCTGCTGCAC
CACCTGCCTC CCGGCGAGTT GCGCCGACAT CCGGGTGCCC TGAAGCTGGA GCTGGAAGCC
CAGCAGAACG TTGGAAACTG GAATGCGGTG CTTGAATTGC TCGGCCAGCT GGAACAGCAC
GATGGTCCTG AGGCAAGCCT CGTAAAACAA CTGAGAGGCA GAGCGCATAT AGAGAATCTC
AGAAGCAGAA TGTTGAATCC GCAGGCACTG AAGGAGTATT GGGAGAGCCT GTCTCCGTCG
GAAAAAAAGG ATGGCAAAGT TGCTGCTGCG GCCGCACGCG CGTTTTCTGC AACAGGAGAT
TGCGCCATGG TGCATCATAT AGTCGAGCAG AGCCTGGAGA CCCAATGGGA TTCGGAATTG
GCCAGGCTCT ATGCGGAATG CGCCGGCAGC GATCCCTTGC GGCAGATAGA ACGTGCCGAG
GCATGGCTTG AAAGGCATTC CAGTGATGCA TCCCTGCTGC TAGCTCTGGG AAAGCTCTGC
GTCAATGGGG AACTGTGGGG CAAGGCTCAG AGCTATCTTG AAGCCAGTTT ATCGGTCAAA
CCGGGATATG CGGCGCACCT CGCGTTGGGA CAGCTGAATG AGAAGCTCGG GCAGCCCGAA
CTGGCAAGGG AGCACTACGG CAAAGGACTG GAACTGGCTG TAAGGCAGCT GGAAACAGCC
GCAATGGCCG AATAA
 
Protein sequence
MKGALWLLAL FMIAVAVTIA ATYNSGYVLI VAQPYRIELS LNLLVLLLLA IILMGYLGMR 
LIAFTARLPA ELSEFRTRRR REKALEGTLE GLKAFFERRY AKAEKSAATV LKMEDSTAFS
AINAIVAARA AHGLRNYSRR DEFIAQAETS APQEVALRLM TQAELLLDEH RPEEALRLLH
HLPPGELRRH PGALKLELEA QQNVGNWNAV LELLGQLEQH DGPEASLVKQ LRGRAHIENL
RSRMLNPQAL KEYWESLSPS EKKDGKVAAA AARAFSATGD CAMVHHIVEQ SLETQWDSEL
ARLYAECAGS DPLRQIERAE AWLERHSSDA SLLLALGKLC VNGELWGKAQ SYLEASLSVK
PGYAAHLALG QLNEKLGQPE LAREHYGKGL ELAVRQLETA AMAE