Gene Hhal_1019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1019 
Symbol 
ID4709597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1090713 
End bp1091738 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content72% 
IMG OID639855490 
Productaminoglycoside phosphotransferase 
Protein accessionYP_001002597 
Protein GI121997810 
COG category[R] General function prediction only 
COG ID[COG3178] Predicted phosphotransferase related to Ser/Thr protein kinases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGGAC TGGCCTGGGA TGATGCCCGC GCCCGGCGGG CGCGTGAGTG GGCGCGGCGG 
CAGCTCGGCG CGGCGCAGGA GCCGGCACTG GAGCCGGCGG CGGCCGACGC CAGCAGCCGC
CGCTACTTCC GGATCCAGGT GCCCGGCGAC GGCAGCCGGA TCATCATGGA CGCCCCGGAG
CAGTCCGAGG CGGTGGCGGC CTTCGTCCGG ATCGCCGGAC TGCTGGCCGA GGCGGGGATC
CACGCCCCGC AGATCGAGGC CGCCGATCTG GAACACGGCC TGCTGCTGCT GACCGACCTG
GGCCACCGCA CCTACCTGCA GGCACTGCAC GAGGGGGACG ACCCGCAACC GCTGCTCGAC
GCCGCCGTCG ACGCCCTGAT CCGCGCCCAG GCCACGGTGC CGGTGGACGG GCTGCCGGCC
TACGATGAGG CCCGGCTGCG CGGGGAGTTG GAGCTGTTCA CCGACTGGTA TCTGCCGTGC
TGCTGCGGGG TGGCTTCCGC GGAGGCCCGC CGGCGGCTCA GGGGTGGGTT GGACGATCTG
CTCGAGCGGG TTGGCGGACA GGCCCGGGCC TTCTGCCATC GCGATTACAT GCCCCGCAAT
CTGATGGTAT GCACCCCGCT GCCGGGGGTG CTGGACTTCC AGGACGCAGT GGCTGGGCCG
GTGACCTACG ATGCGGTCTC GCTGCTGCGC GACGCCTTCA TCAGTTGGCC GCGGGAGCGC
GAGGTCGCCA CCCTGCAGCG CTACCACCGC CGGGCACGGG CCGCTGGCGT GCCGGTGCCC
GAAGGCTTCG AGGCGTTCTG GGCCGATTGC CAGTGGATGG GCGTGCAGCG CCACCTCAAG
GTGCTCGGCA TCTTCGCCCG GTTGGCCTAC CGCGACGGTA AGCCGCGCTA TTTGACCGAT
GCGCCGCGGT TCTTCGCCTA TCTGCGTGCC GCAGCCGAGG AGCAGGCGGC GCTGCGACCG
CTGGTCGATG AGGTGCTGGC CCTGGCGGCG GTGCCGGGCC GCGTGTCGTA CGCCGAGGTG
GACTGA
 
Protein sequence
MNGLAWDDAR ARRAREWARR QLGAAQEPAL EPAAADASSR RYFRIQVPGD GSRIIMDAPE 
QSEAVAAFVR IAGLLAEAGI HAPQIEAADL EHGLLLLTDL GHRTYLQALH EGDDPQPLLD
AAVDALIRAQ ATVPVDGLPA YDEARLRGEL ELFTDWYLPC CCGVASAEAR RRLRGGLDDL
LERVGGQARA FCHRDYMPRN LMVCTPLPGV LDFQDAVAGP VTYDAVSLLR DAFISWPRER
EVATLQRYHR RARAAGVPVP EGFEAFWADC QWMGVQRHLK VLGIFARLAY RDGKPRYLTD
APRFFAYLRA AAEEQAALRP LVDEVLALAA VPGRVSYAEV D