Gene Haur_4057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4057 
Symbol 
ID5735915 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5179654 
End bp5181411 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content51% 
IMG OID641281208 
Productphosphodiesterase/alkaline phosphatase D-like 
Protein accessionYP_001546817 
Protein GI159900570 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3540] Phosphodiesterase/alkaline phosphatase D 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCCGCT TATTTGGGCT ATGTTGTTTA TGTTGCTTGC TAGCTGGAAG TTGGTCAGGC 
AGTTCAAGCG TTGGCCGGAC AAATCTGCTC TTTCCGCCCG CCGATTTGCC GCAAGGCGTG
GCTGTGGGCG ATGTAACTGC TACCAGCGCG GTGCTTTGGG CACGTTCCGC CAGCCTTGGC
TCGGTTAGCT TTGAATATAG TCTCAACGCC AATTTTAATC CAGTGGCTGG CTCGGCCACA
GTTAGCATCA CCGACACGAT GCAACCAGCC AAAACCAGCA TCAGCAATTT ACAACCGGCT
ACTAGTTATT TTTATCGTGC TACAACGCTC AGCAATACTT CATTTGCTGG TAGATTTCGC
ACCGCCCCAA CCAGTGGAAC CTATAGCAAT TTACGCTTTG GCGCAAGCGG CGATCAACAA
GGGGCACTCG CGCCGTTTCC AGCTTTAGCC AACGCCGATC AGCGCGATCT TGATCTGTTT
ATTCATCTTG GCGATTCAAT TTATGCTGAT ATTGGCTCGC CTGTGTTGGG CACAACCGCC
AAAACTTTAG CCGAATTTCG CTTGAAACAG ACCGAAAGTT ATAGCACGCG GCTGAATTTG
AACACGCTGG CCGATTTACG CGCGACCACC GCATGGCTAG CGACAACTGA TGATCATGAA
GTTGCCAACG ATCACGCTGG TGGGGCAGCA CCGAGCAGTG ATCCACGTTT TTTGCCGACC
AATGCCAGCT ACATCAATGA TACTGATTAT TTTGAAGCGG GCTATCAAGC TTTCGTCGAA
TATAACCCAG TTAATGCGCT GTTTTATGGC GCGACTGGCG ATCTGCGGAC TGCCAACGAG
CGTAAACTTT ACCGCTACCA ACCATATGGT AATACGGCAG CCTTTTTTGT GCTCGATGGC
CGTTCATTTC GCGACCAAAA ATTACCAGCA CCCAACAACA CGCCCTCAGA AATTGTGGCC
TTTTTGACTG CCGTATTTAG CCCAACCCGC ACCTTGCTTG GTCAAGCTCA ACTCAGCCAA
CTCGAAAACG ATTTGCTCGC CGCTGATCAA GCCAATATCA CCTGGAAATT TGTGATTGTG
CCTGAGCCAA TTCAAAACCT TGGCACAGCA GCGGCCAACG ATCGCTTTGA GGGCTATGCC
GCCGAACGCA GCCGCATTTT GAGCTTTATC AACGATCATG CAATTGAAAA TGTGGTGTTT
ATCGCCGCCG ATATTCATGG TACGGTTGTC AATAATTTAA GCTACCAAAC CGCCGCAGGC
CAGCCGCAAA TCCCCACCAA TGCCTGGGAA ATTTCGGTTG GTTCGGTCGC CACGACCTCG
CCATTTGGCA TGCGAGTAGC AGCAGGAGCC TTATCAACCG GAATTATTAG CTCAACCACT
TACAATAATT ATTTACAACT ACCCAACGAT CAACAAGACG CGGCGGCGGA AGGTTGGCTC
AACACCGTAT TAAATGTATT CGGCTACACG CCCGTGGGCT TGCAGGATGC GCCGTTCGCT
GAACGCACCA CGCTCTTGCA AGGGCGCTAT TTTGCTGGCC ACTACTTTGG CTGGACTGAA
TTTGAAATTA GCCAGCCTGA CCAAACCTTG CGCGTTTCGA CCTATGGTAT CGATACTTAT
GGAACCAGCG AACTGGCCAA CAATCCAGGC GAGGTGGTGA GCCGTATGCC AGTGATTGTG
CAGCAATTCG AGGTTGATCC CATCGTCAGC ATCTCGCCAA CGCTGATACT CAATTATTTG
CCAGCAGTAA CGAAGTAA
 
Protein sequence
MRRLFGLCCL CCLLAGSWSG SSSVGRTNLL FPPADLPQGV AVGDVTATSA VLWARSASLG 
SVSFEYSLNA NFNPVAGSAT VSITDTMQPA KTSISNLQPA TSYFYRATTL SNTSFAGRFR
TAPTSGTYSN LRFGASGDQQ GALAPFPALA NADQRDLDLF IHLGDSIYAD IGSPVLGTTA
KTLAEFRLKQ TESYSTRLNL NTLADLRATT AWLATTDDHE VANDHAGGAA PSSDPRFLPT
NASYINDTDY FEAGYQAFVE YNPVNALFYG ATGDLRTANE RKLYRYQPYG NTAAFFVLDG
RSFRDQKLPA PNNTPSEIVA FLTAVFSPTR TLLGQAQLSQ LENDLLAADQ ANITWKFVIV
PEPIQNLGTA AANDRFEGYA AERSRILSFI NDHAIENVVF IAADIHGTVV NNLSYQTAAG
QPQIPTNAWE ISVGSVATTS PFGMRVAAGA LSTGIISSTT YNNYLQLPND QQDAAAEGWL
NTVLNVFGYT PVGLQDAPFA ERTTLLQGRY FAGHYFGWTE FEISQPDQTL RVSTYGIDTY
GTSELANNPG EVVSRMPVIV QQFEVDPIVS ISPTLILNYL PAVTK