Gene Hhal_1117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1117 
Symbol 
ID4710067 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1212585 
End bp1213748 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content70% 
IMG OID639855589 
Producttype I phosphodiesterase/nucleotide pyrophosphatase 
Protein accessionYP_001002695 
Protein GI121997908 
COG category[R] General function prediction only 
COG ID[COG1524] Uncharacterized proteins of the AP superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.704778 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAGAC CCAGCTACGA GGGCAGCGGC CTGGTCAACC TGATGGCGTC CCTGGGCCGT 
GCCTTCGGGG CGGCGTCCAG CCACTATCCG GCCCTGGATC CTGAGCCGGA GCTGGGGCTG
GAGGAGGCGC GCACGGTCAT CCTCTGGATC ATGGATGGCC TCGGCGATCA CTACCTGGCC
AGGCAGCCGG GCAGCAGCCT GGCGCGGGAT CGGGTGCGGG TGCTGACCTC GGTCTTCCCC
GCCACCACCT CGGCGGCCTT GACCAGCATC ATTACCGGGC GTCCGCCGCG GGGGCACGGG
GTCACCGGCT GGTTTATGTA CGTCCACGAG CTGGGGGCGG TCACCGCCTG GCTCCCCTTC
GGTCCGCGGG TGGGCAAGGG GCAGTGGTCC AGCATCGAGC CGGAGAGCGC CGAGCTGCTG
CAGCGCGACC CGATCTGGGA TCGGTTTCAG GCCGAGACGC ACGTCGTTCA ACCCTCCTGG
CTGGTCGACA CGCCGTATAG CCGGGCGGTC ACCGGGCGCT ATGCCCGCCG GCACGGCTAT
CAGGGGTTGG ACGAGCTGCG CGAGGTGCTG GTGCGCATCG CCCGCGAGCC CGGTCGGCAG
CGACGGTTCG TCTACGCCTA CTGGCCGGAC CTGGATACGC TGAGCCACCA GCACGGTGTC
GACAGTGCAG CGGTGCGCGA CCAGTTCCGC TCCATCGACA TCGCCTGGCA GCGGCTGCTC
GATGGCCTTC AGGGCACCGA CACCGTGATC CTCGGCACCG CCGACCACGG CCTGATCGAT
ACTGCCCCCG AGCGGACCCT CTATCTGGGG GACCATCCGG AGTTGGCCGA GATGCTGGCC
CTGCCGCTGT GCGGCGAGCC CCGGGCGGCC TACTGCTACC TGCGTCCGGG CACCGAACTC
GACTTCCAGT CCTATTGCCG CGAGCGCCTG GGTACGGTCT GCCAGGTCGC CCGCTCCGAG
GAGCTCCTGG CGGCGGGTTG GTTCGGGCCC ATGCCGGAAC ACCCGAAGCT GCGCCGGCGG
ATCGGTGATT GGGTGCTGCT GCCGGCCGAT GGCTGGGTGA TCAAGGACCG GCTGGTGGGC
GAGGGGCGCT TCGCCCAGGT GGGGGTGCAC GGCGGGGCGT CGGCGAGCGA GCAGTGGGTG
CCGCTGATTG CCGCACGGCC GTGA
 
Protein sequence
MDRPSYEGSG LVNLMASLGR AFGAASSHYP ALDPEPELGL EEARTVILWI MDGLGDHYLA 
RQPGSSLARD RVRVLTSVFP ATTSAALTSI ITGRPPRGHG VTGWFMYVHE LGAVTAWLPF
GPRVGKGQWS SIEPESAELL QRDPIWDRFQ AETHVVQPSW LVDTPYSRAV TGRYARRHGY
QGLDELREVL VRIAREPGRQ RRFVYAYWPD LDTLSHQHGV DSAAVRDQFR SIDIAWQRLL
DGLQGTDTVI LGTADHGLID TAPERTLYLG DHPELAEMLA LPLCGEPRAA YCYLRPGTEL
DFQSYCRERL GTVCQVARSE ELLAAGWFGP MPEHPKLRRR IGDWVLLPAD GWVIKDRLVG
EGRFAQVGVH GGASASEQWV PLIAARP