Gene Haur_2658 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2658 
Symbol 
ID5734553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3412435 
End bp3413676 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content51% 
IMG OID641279800 
Productmetallophosphoesterase 
Protein accessionYP_001545424 
Protein GI159899177 
COG category[R] General function prediction only 
COG ID[COG1408] Predicted phosphohydrolases 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00191858 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGCAA TTAATCCTTG GGCAAAACGT CTATATCAGA CTGGTCGCTG GGCCAGCAAC 
GTCGTTATTT GGCTGGTACT ATGGATCAGT TGTGTGGCGA TCATCTTTTT GATGATTCGC
TATTTCAGCG GCGCAGAATG GCTAAGCAAT CTGCAAGGCT CAGCCCATTT TGTCGCCGAA
TTTGGCATGC GCCTGTTGAT GGCTGCGCCA TTTACCCTCT GGCTTTTGTT GATGCTGCGA
CCAATTCGCA CCCGTCGCTG GGTCGTCAGC CACATACTCA AATTGACTCA ACGCTTGCGC
CGCCAGCCCA AGCCCCAACT TGAGCCAGCC ATTGATCAGC TAGATCGAGA AGTTCAACCT
ACAACCAACC CAACTATGAG CGCAAAACCG CTCAGTCGCC GTCGATTTTT GGTTGAATCA
GGACTGGTTG GTGGTGTGGT CGGTTATGCA ATGTTGATCG AGCCATATCA GATTCAGGTC
CGCGAAGTTA ATTTGCCAAT CGCCAATTTG CCCGAACGTT TTCGCGGCAT GCGCATCGCC
CAAATGAGCG ATTTGCATAT CAATGCCTAC ACCACCAGCG CCGATTTGGC CCGCGCTGTG
GCCCAAATCA ACCAGCTCAA CCCTGATATG GTGCTGCTCA CTGGCGATTT TGTCGATTGG
GATGCACGCT TTGCTGATGC CGCCACCGAG CCATTCCGCC AGCTGCGTGC ACCCGAAGGT
ATTTATTCGG TGCTTGGCAA CCACGATTAC TACAGTGGCA AGATCGATAT AATCAAACAA
GCCATCCAAC GCCACGATTT AGGTTTGTTG GTCAATCAGC ATACCATTTT GCGCCGTGGC
GCTGATCAAT TGGTCTTGGT AGGTTTTGAT GATCCACGGC ATAATCGTAG CGGCGGGCCA
CGGCTCAGCC CTGAGAGCAT CAATCCTGAA GCGGCCTTGA AGGGTACGCC GAAAAATGTT
GCCCGCCTAG CAATGGTGCA TAATCCAGTA ATTGTGCCGC ATTTTGTCGC CAATTATCAG
CTTGATGTGA TTCTATCGGG GCATACCCAT GGCGGCCAAT TCCAAGTGCC AATTCTCACC
GACCAGCTGG TGGGCAATGC TGAATATTTT GTGCGCGGCC ATTACGATTT GGGTAAATCA
CAGGTTTATG TCAACAGTGG TTTTGGTTTT ACCGGGCCGC CCTTGCGATT TCGCTCGGCT
CCAGAAATTA CATTAATTAA TTTAGTTAAT GCCAAAGCCT AG
 
Protein sequence
MAAINPWAKR LYQTGRWASN VVIWLVLWIS CVAIIFLMIR YFSGAEWLSN LQGSAHFVAE 
FGMRLLMAAP FTLWLLLMLR PIRTRRWVVS HILKLTQRLR RQPKPQLEPA IDQLDREVQP
TTNPTMSAKP LSRRRFLVES GLVGGVVGYA MLIEPYQIQV REVNLPIANL PERFRGMRIA
QMSDLHINAY TTSADLARAV AQINQLNPDM VLLTGDFVDW DARFADAATE PFRQLRAPEG
IYSVLGNHDY YSGKIDIIKQ AIQRHDLGLL VNQHTILRRG ADQLVLVGFD DPRHNRSGGP
RLSPESINPE AALKGTPKNV ARLAMVHNPV IVPHFVANYQ LDVILSGHTH GGQFQVPILT
DQLVGNAEYF VRGHYDLGKS QVYVNSGFGF TGPPLRFRSA PEITLINLVN AKA