Gene Haur_4617 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4617 
Symbol 
ID5736464 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5902810 
End bp5904318 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content50% 
IMG OID641281781 
ProductPpx/GppA phosphatase 
Protein accessionYP_001547376 
Protein GI159901129 
COG category[F] Nucleotide transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0248] Exopolyphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCAAC ACGTTGGAAT TATTGACCTT GGCTCCAACA CCGCCCGCAT GATCGTGGTG 
CAATACCAGC CCTATTACTC CTTCAAACTG GTCGAAGAAG TTAAAGAAAA TGTGCGCTTG
GCACATAATG TTGGCGCTGA TAATCAATTG CAAGCTGAGC CGATGGCCAT GGCAATTGAA
ACCTTGCGCA TGTTTAGCAA TTTTTGCCGT GCCTTGGGGG TTAATGAAGT TGTGGCTGTG
GCCACCAGCG CCGTGCGCGA TGCCGCCAAT CAAGCCAGCT TTTTGGCCCA AGTTAAAGAA
GAAACTGGCC TCGATTTACG CGTACTCAGC GGCGATGAAG AAGCCTACTA CAGCTACCTC
GGGGTGATTA ATACCCTTGG GGTCAGCAAT GGTTTTATGT TTGATATTGG CGGTGGCAGC
GTCGAATTGG CCTTAGTTCG AGGCCGGGGC TTGGCGCATA CCACTTCATT ACCACTTGGT
ACAGTGCGGC TCACCGAGCA AATTTTGCGC AGCGAAACCC CCAGCAAAGC CGAACTCAAA
GCCCTCGATC GCCATTTAGA TGAGGCGTTA GCCGAACTAG ACTGGTTTCG ACCACAAGGT
AGCAAATTGC CGTTAATTGG GGTTGGCGGC ACGGTGCGCA ACCTTGCCAA ACTCGAACAA
CGTGCCCAGC GCTATCCACT CGATATTGTG CATGGCTATA CGATGTCGTT GCAACGGGTC
GATGAATGGG CTACCCGTTT GAGCAAACTC AATCGTAATG AGCGCGAGCA GCTTGATGGC
CTCAACAATG ATCGCGCCGA TGTTATTACC GCTGGTGTAC TGTTGATTCG AGCGTTGATG
CAACGTTGTG GTGCTGATAG TTTGTGGATT TGTGGTCATG GTCTGCGTGA TGGTATCTTC
TACGAGCAAT TTCTGCGTGG CTCACAACCG CCCTTGCTCG GCGATGTCCG CCAGTTTTCG
GTCGAGAATT TGGCACGAAT CTATGGCTAC AATGTGGTAC ATGTTGCCAA AGTGCGCGAA
TTAAGCCTCG CCTTGTTTGA TCAACTGCAA AGTTTGCATG GCTATGGAGC GTGGGAACGC
GAATTACTCG AAGCTGCGAC GGTAGTCCAT GACATTGGGG TAGCAGTCAA TTTCTACGAT
CATCATAAAC ATGGCTTATA TTTAATTCTC AACTCAATGC TGAATGGCTA TACCCACCGC
GAAATGGCCA TGGTAGCCTT GCTCACTCGC CATCATCGTA AGGGCGGCGT GACTGATGCA
GGCTTGGGTG GGGTTTTGGC TGAGGGTGAT CTTGAGCGGG TGGGCAAATT AAGTGCCTTG
CTCAGAATAG CTGAATATCT AGAGCGCTCC AAGAGCCAAG TTGTGCAAAG TATCGTGTGC
AAAATTGAGA AAAATCAGGT GCGGGTGAAG GTGCAGGCGG TTGGCGACGC TTCGATTGAA
ATTTGGGATG CGAATCGCAA AACCAATCTA TTTCGCAAAG TCTATGGAGT TGAAATGTTG
ATTGAATAG
 
Protein sequence
MTQHVGIIDL GSNTARMIVV QYQPYYSFKL VEEVKENVRL AHNVGADNQL QAEPMAMAIE 
TLRMFSNFCR ALGVNEVVAV ATSAVRDAAN QASFLAQVKE ETGLDLRVLS GDEEAYYSYL
GVINTLGVSN GFMFDIGGGS VELALVRGRG LAHTTSLPLG TVRLTEQILR SETPSKAELK
ALDRHLDEAL AELDWFRPQG SKLPLIGVGG TVRNLAKLEQ RAQRYPLDIV HGYTMSLQRV
DEWATRLSKL NRNEREQLDG LNNDRADVIT AGVLLIRALM QRCGADSLWI CGHGLRDGIF
YEQFLRGSQP PLLGDVRQFS VENLARIYGY NVVHVAKVRE LSLALFDQLQ SLHGYGAWER
ELLEAATVVH DIGVAVNFYD HHKHGLYLIL NSMLNGYTHR EMAMVALLTR HHRKGGVTDA
GLGGVLAEGD LERVGKLSAL LRIAEYLERS KSQVVQSIVC KIEKNQVRVK VQAVGDASIE
IWDANRKTNL FRKVYGVEML IE