Gene Haur_4793 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4793 
Symbol 
ID5736637 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp6110182 
End bp6111711 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content53% 
IMG OID641281958 
Productphosphodiesterase 
Protein accessionYP_001547552 
Protein GI159901305 
COG category[R] General function prediction only 
COG ID[COG1418] Predicted HD superfamily hydrolase 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG
[TIGR03319] conserved hypothetical protein YmdA/YtgF 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0108516 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGACTTTA TCCTTCCAGC TCTGGCTTTA CTCATTGGCC TAGCTGCAGG CGCAGGCGGT 
GCCTATGTTT GGTTAACGAA AAAATCTGGT GTAGCACTGG CCCAAGCTGA AGCAGCAGCA
CAACTACAAT TAGAACAAGC CAAGACTGAG TACAAACAAT TGCAATTAAA AGCCCGCGAA
GAACAACAAC ACCTGCGGGA AGTTGGTGAA AACGAACTCC GCGAAGCCCG CTTACGCTTG
CAAAAACAAG AAGAACGGGT CAATCGCAAA GAAGAAACTC TCGAACGTAA GCTCGACCAA
ACCGAAAAGC GCGATCGCCA GTTAGGCCAG CGCGAGCGAG CGGTCGAAAA CTTACACAGC
GAAGCCGAAC GCCTACGCAC AGCTCAACAA GCTGAGCTAG AGCGGGTCGC AGGTCTCACT
AACGAACAAG CACGTGAAAT TATTCTCTCT CGTGTTGAAC AAGAAACCCG TGCCGAGGCT
GCTCGCCGCA TCCACGAAAT CGAAGCGGCA GCAAAGCTCG ATGGCGATCG GATTGCACGC
AAAATCGTTG GGTTGGCGAT CCAACGCTGT GCATCAGATT ATGTCGCCGA AATTACTGTA
AGCACGGTAC AACTGCCCAG CGAGGAACTC AAGGGTCGGA TTATCGGGCG CGAAGGGCGC
AATATTCGCG CATTCGAGCA AATTACTGGT GTCGATATTA TTGTCGATGA TACGCCCGAA
GCGGTCACAC TCTCGTGTCA CGACCCAGTA CGTCGTGAAG TAGCCCGCTT AACCCTGAAC
CGATTGTTGA AAGATGGTCG GATTCACCCC GCCCGGATTG AGGAAGTCGC TGATAAAACC
CGCCAAGACC TTGAGCAAAT AATCCGCGAA GAAGGCGAAC GCGTGGCCTA CGAAGCCAAC
GTCCAAGGCC TTCATCCAGA TCTCTTGAAG ATTCTGGGGC GGTTGAAATA TCGTACTAGC
TACGGCCAAA ATGTCTTGCA GCATTCGTTA GAATGTGCCT TTTTGGCAGG AGCGATGGCA
GCAGAATTGG GTGCCAATGT GCAGGTTGCC AAGACCGCAG CGCTGCTCCA TGATATTGGG
AAAGCCATTG ACCACGATGT ACAAGGGCCG CACGCGTTAA TCGGGGCTGA TATTGCCCGA
CGGCTTGGCC GCAACGCGGC GGTAGTTCAT GCAATTGCTG CCCATCATAA CGATGAGGAA
CCTCAAACTG TTGAGGCTTT TCTTGTTCAA GCCGCCGATG CCATCTCTGG TGGTCGCCCA
GGCGCACGCC GCGAGACCAT CGACTTGTAT ATTAAGCGGC TCGAAGCTCT TGAACATGTC
GCCACATCGT TTCCTGGTGT TCAACGGGCA TTCGCAATTC AAGCAGGCCG GGAAGTGCGG
GTTATGGTGC AACCCGATGC GCTTGATGAT CTTGAAAGTA TTCATTTGGC CCGCGAAGTG
GCCAAAAAGA TCGAGAGTAC CCTGCAATAT CCAGGTCAAA TCAAAGTTAC CGTCATCCGC
GAAACCCGTG CGGTTGATTA TGCACGGTAA
 
Protein sequence
MDFILPALAL LIGLAAGAGG AYVWLTKKSG VALAQAEAAA QLQLEQAKTE YKQLQLKARE 
EQQHLREVGE NELREARLRL QKQEERVNRK EETLERKLDQ TEKRDRQLGQ RERAVENLHS
EAERLRTAQQ AELERVAGLT NEQAREIILS RVEQETRAEA ARRIHEIEAA AKLDGDRIAR
KIVGLAIQRC ASDYVAEITV STVQLPSEEL KGRIIGREGR NIRAFEQITG VDIIVDDTPE
AVTLSCHDPV RREVARLTLN RLLKDGRIHP ARIEEVADKT RQDLEQIIRE EGERVAYEAN
VQGLHPDLLK ILGRLKYRTS YGQNVLQHSL ECAFLAGAMA AELGANVQVA KTAALLHDIG
KAIDHDVQGP HALIGADIAR RLGRNAAVVH AIAAHHNDEE PQTVEAFLVQ AADAISGGRP
GARRETIDLY IKRLEALEHV ATSFPGVQRA FAIQAGREVR VMVQPDALDD LESIHLAREV
AKKIESTLQY PGQIKVTVIR ETRAVDYAR