Gene Hhal_0215 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0215 
Symbol 
ID4710080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp248668 
End bp250197 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content73% 
IMG OID639854674 
ProductPpx/GppA phosphatase 
Protein accessionYP_001001811 
Protein GI121997024 
COG category[F] Nucleotide transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0248] Exopolyphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAGACG AGCAGGTGAT CGCAGCGGTC GATCTGGGCT CGAACAGTTT CCATATGGTG 
GTGGCGCGCA TCGATCCGGC CACCCGCACG CTGCGGGTGG TGGATCGGTT GCGCGAGACG
GTCCGGCTCG GCGCCGGCCT GGGCGAGGGG CAGAAACGGC TGAGCGAGGA CGCCCGGGAG
CGGGCGCTGG CTTGCCTGGC CCGCTTCGGG GATCGGTTGC GCCGCCTGCA GGCGCAGCGG
GTTCGTGCCG TGGGGACGAA CACCCTGCGC AGGGCGCGGG ATGCCACGGA CTTCATGCAG
GAGGCCGAGG GCGCCCTGGG GCACCCGATC GAGGTGGTTT CGGGCTATGA GGAGGCTCGG
CTGATCTACC TCGGTGTGGC CCACAATCTG GGGTTCGACG AGCGCCGGCG GGTGGTGATC
GATATCGGTG GGGGTAGTAC CGAGCTGATC CTCGGCCGCG GCCCGCGGGC CGAGCAGATG
GAGAGCGTCC ATCTGGGGTG CGTCTCGCTC ACCGGACGCT GCTTCGCCGA CGGCCGCATC
ACCGGGCGGC AGTTCCAGAA GGCCCTGGTC CTCGCCCGGC TGGAGCTGGA GCCGGTGGAG
GGCGCCTTTC GCTCGCCGGC GTGGCAGGGG GCTGTGGGCG CCTCCGGGAC GGTGCGCGCC
GCCGCCGACG CCTGCGCCAG CCGGGACTGG TCACCGCCCG GGTGGATCAC CGCCGAGGCG
CTGGGCCGCC TGCGCCGGCT GGCGGTGGAG GCCGGCGACG CCGAGACCCT GGGCGAGTGG
CTGGGGCTTT CCGGCGACCG GCGCCAGGTC TTCCCGGCGG GGCTGGCGGC CCTCTGTGCG
GTCTTCGAGG CGCTGGGCAT CGAGCGTATG GAGGTGGCCG ACGGCGCCCT GCGCGAGGGG
GTGATGTACG ACCTGGCGGG CCGCCTGGGG ATGCTCCAGC ACAGCGAGGA CGCCCGGGCC
AATACGGTCT CGGCGCTGCG CCGGCGCTAT TCGGTGGAGG CCGGCCAGGC CGACCGGGTG
GCGGCGACGG CGGCGGGGCT GCTCGATCAG GTGGCGCCGG GGTGGGGGCT GGCAGGACGC
TTTTACCGCG ACATCCTCGA CTGGGCGGCC CAGCTCCACG AGATCGGTCT GGATATCTCC
CACGCCCAGT ACCACAAGCA CGGTGCCTAC ATCCTGCGTA ACGCGGACAT GGCCGGCTTC
TCCCGTCAGG AGCAGCAGCT GCTGGCCCTG CTGGTGCGGG TGCACCGCCG TAAGCTGGCC
CGCGGGCAGT TGAAGGCGCT GCCCCGGCGC TGGCTGGACA CCGGCAAGCG GCTGGCCGTG
GTGCTGCGCC TGGCGGTTCT GCTCCACCGT GGGCGGGCCG ACGGTCGGGT GGTGGAGCCG
CGCCTGGAAC CGCTGACCGA CGGGCTGCGG CTGTGGTTCC CGTCCGGGTG GCTGGCGGAC
AACCCCCTGC TCCAGGCCGA TCTGCTCCAG GAGCAGCGCT ACCTGGAGCG TGCCGGGATG
ACCCTGGAAC TGGCCGAGGC CCCCGAGTAA
 
Protein sequence
MRDEQVIAAV DLGSNSFHMV VARIDPATRT LRVVDRLRET VRLGAGLGEG QKRLSEDARE 
RALACLARFG DRLRRLQAQR VRAVGTNTLR RARDATDFMQ EAEGALGHPI EVVSGYEEAR
LIYLGVAHNL GFDERRRVVI DIGGGSTELI LGRGPRAEQM ESVHLGCVSL TGRCFADGRI
TGRQFQKALV LARLELEPVE GAFRSPAWQG AVGASGTVRA AADACASRDW SPPGWITAEA
LGRLRRLAVE AGDAETLGEW LGLSGDRRQV FPAGLAALCA VFEALGIERM EVADGALREG
VMYDLAGRLG MLQHSEDARA NTVSALRRRY SVEAGQADRV AATAAGLLDQ VAPGWGLAGR
FYRDILDWAA QLHEIGLDIS HAQYHKHGAY ILRNADMAGF SRQEQQLLAL LVRVHRRKLA
RGQLKALPRR WLDTGKRLAV VLRLAVLLHR GRADGRVVEP RLEPLTDGLR LWFPSGWLAD
NPLLQADLLQ EQRYLERAGM TLELAEAPE