Gene Hlac_0502 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0502 
Symbol 
ID7400383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp521826 
End bp522815 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content66% 
IMG OID643707567 
Productphosphate binding protein 
Protein accessionYP_002565174 
Protein GI222478937 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0226] ABC-type phosphate transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR02136] phosphate binding protein 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.940111 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.578937 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATCAA GCCACGACGC GGATACCGAG GGGATCACGC GGCGGAAATC GCTTTCGGCG 
CTCGCCGGGG CGGGCGCGCT GGCGCTCGCA GGATGTACGC AGAGCACGGG AGGCGGCAGT
GGTGACGCCC TGTCCGGCTC GATCAACATC GCGGGTTCCT CGACCGTCTT CCCGCTGATG
AGCGCGATCG GGGAGGACTT CGCCGCGGAG CACGACCAGG TCTCGGTCGA CATCAGTTCG
ACCGGCTCGG GCGGCGGGTT CTCGAACTAC TTCTGCGTCG GCGACACCGA CTTCAACAAC
GCGAGCCGGC CGATCCAGCC CGCAGAGGAG GAACTCTGCG AGGAGAACGG CGTCGAGTAC
GTCGAGCTCA TCGCGGCGAC GGACGCGCTC ACGATCGTCG TCAACCCCGA GGCCGACTGG
ATCGACTCCC TCACCGTCGA GGAGCTCGCA CAGATCTGGG AGGAGGACCC CGCCCAGACC
TGGGACGAGG TCCGCGACGA GTTCCCGAAC GAGGAGATCG AGCGGTTCGG CGCCGCCGAC
ACCTCCGGGA CGTACGACTA CTTCATCGAG AGCATTCTGG AGGAGCGCGG TCACACTAGC
GACTACCAAG CGACCGAGCA GGACAACTCG ATCGCACAGG GCGTCTCGGG CAGCGAGTAC
GCGATCGGCT ACTTCGGCTT CGCGTACTAC TTCCAGAACC CCGATCAGCT CAAGGCGCTC
GGGATCGACG ACGGCAACGG GCCGGTCGAG CCGAGCCTCG AAACGGCGTC CAGCGGCGAG
TACACTCCCC TTTCCCGGCC GCTGTTCACC TACCCGTCGA TCGAGTCACT CGGCAAGGAG
CACGTCGCCG AGTTCGCGCG CTACTTCGTC GAGCAGACGA CCAACGAGGA CCTCGTCGCC
GGCGACGTGG GATACGTGCC CGCGACTGAA GAGACCCAAG AGGAGCAGAT GGAGATCCTC
GAGGACGCTA TCGAGCGGGC GCAGGAGTAA
 
Protein sequence
MASSHDADTE GITRRKSLSA LAGAGALALA GCTQSTGGGS GDALSGSINI AGSSTVFPLM 
SAIGEDFAAE HDQVSVDISS TGSGGGFSNY FCVGDTDFNN ASRPIQPAEE ELCEENGVEY
VELIAATDAL TIVVNPEADW IDSLTVEELA QIWEEDPAQT WDEVRDEFPN EEIERFGAAD
TSGTYDYFIE SILEERGHTS DYQATEQDNS IAQGVSGSEY AIGYFGFAYY FQNPDQLKAL
GIDDGNGPVE PSLETASSGE YTPLSRPLFT YPSIESLGKE HVAEFARYFV EQTTNEDLVA
GDVGYVPATE ETQEEQMEIL EDAIERAQE