Gene Haur_0849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0849 
Symbol 
ID5732750 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp958523 
End bp959668 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content52% 
IMG OID641277981 
ProductMazG family protein 
Protein accessionYP_001543625 
Protein GI159897378 
COG category[R] General function prediction only 
COG ID[COG3956] Protein containing tetrapyrrole methyltransferase domain and MazG-like (predicted pyrophosphatase) domain 
TIGRFAM ID[TIGR00444] MazG family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00848879 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAAGCC AACTTCTTAC GCAATTGCCA CTTGATTTGA GCCAAGGCTT TCAAGTTGTG 
CCCGCTCATA AATTACTAGC CCCAATTCCG GCTCCTGCCG AGGGCGCTGA TCGGGCTTGG
TGCGAATTAC AAAATATTGC CGAATACCCA AGCTTCGTCA GCCCACTACC GTTTCAAGCC
ACTCAAGCCC TGATCATTAC TGAAATCGAT GTAAGCAAGC TAGCAACGGT TCGAGCCAGT
TTATTGCGGC GCTACCCTGC CGAACATCCT GTGCATAACT TAAATGAAAC TGGCCTGAGC
CAACAAACCC TCGCCACTGC TAGTAGTGCT CAAGCTTGGT ATCTGCCAGC ACTCAGCATC
GAAACCGATG TGGCAAGCCC AAGCACTTTA GAATGGATTA TGGCGCGTTT GGCGGGGCCA
CACGGCTGCC CATGGGATCG CAAACAAACG CATGCGAGTT TACGTGAATT TTTGCTCGAA
GAAACCCATG AAACGCTCGA AGCCCTTGAT GCCGAAGATT GGCCTAATCT CAAAGAAGAA
TTGGGCGATT TATTATTGCA AATTGTCTTT CATGCCGAGT TTGGTCGCCA AGCAGGCCGC
TTCAACCTTG ATCAGGTCTA TACAGCGATT AACAGCAAGC TCATTCGCCG CCATCCGCAT
ATTTTTGGCA CAACCGAGGT TAGCGATGCC GACGAAGTAT TACGCAACTG GGATGCGATT
AAGGCAACCG AGCATCAGGA AAAAGGCAGC CAACGTGAGA GTGCGCTCGA TGGGATTGCC
AAAACCCTGC CGCCGCTGGC AACCGCCCAA CTCATTGGCA AAAAAGCCGC CAAAGTTGGC
TTCGACTGGC CCGATGTTAG CGGAGTTTGG GCCAAAGTCC ATGAAGAAAT TGCTGAATTG
CAAGCCGCCA CTAGCCCTGA AGAACAAGCC GCTGAGTTTG GTGATGTGCT TTTTGCCCTA
ACCAATCTTG CTCGTTGGCT CAAAATTGAT TCAGAAAGCG CCTTACGCGG CACGATCACC
AAATTCCGCC GCCGTTTCGT GGCGGTGGAG CAGGCCGCCC AAGCCCAAGG CCGCCAACTT
AGTCAACTCA GCCTAAGCGA AGCCGACACG CTCTGGGAAG CCGCCAAACG AGCTGAGAAA
CAATAA
 
Protein sequence
MLSQLLTQLP LDLSQGFQVV PAHKLLAPIP APAEGADRAW CELQNIAEYP SFVSPLPFQA 
TQALIITEID VSKLATVRAS LLRRYPAEHP VHNLNETGLS QQTLATASSA QAWYLPALSI
ETDVASPSTL EWIMARLAGP HGCPWDRKQT HASLREFLLE ETHETLEALD AEDWPNLKEE
LGDLLLQIVF HAEFGRQAGR FNLDQVYTAI NSKLIRRHPH IFGTTEVSDA DEVLRNWDAI
KATEHQEKGS QRESALDGIA KTLPPLATAQ LIGKKAAKVG FDWPDVSGVW AKVHEEIAEL
QAATSPEEQA AEFGDVLFAL TNLARWLKID SESALRGTIT KFRRRFVAVE QAAQAQGRQL
SQLSLSEADT LWEAAKRAEK Q