Gene Hlac_3551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3551 
Symbol 
ID7402394 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012030 
Strand
Start bp298078 
End bp299238 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content64% 
IMG OID643710089 
ProductABC-type phosphate transport system, substrate binding protein 
Protein accessionYP_002567655 
Protein GI222481419 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0226] ABC-type phosphate transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR02136] phosphate binding protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACGCG ACTCAGGGCG TCAGGCGAGC GCGGTATCAC GTCGAAAGTA TCTGCTGACG 
TCGGCCGCCA TCGGGACGGC GGGCCTTGCC GGGTGTAGCG ACAGCGATAG CGGAGGCACC
GGAAGCGACA GCGGAGGCGA CGAGGGCAGT AACTCTATCG AATCTGGCTC CTCGGGGCCT
TCGACGGTGA CCGCCGAGGG ATCCTCGACG GTGTACCCTA TCTCGAACAG GGGGAGTTCC
TACTGGAACT CGAACTCGCC GGCGAGCGAC GGCGAGTACT GGGGCTCGAA CGACGAGTCG
TCGGTCGCCG GCTGGGACCA GATCGAGACC GACCAGAACA TCGCCGACTA CTTCGCGAGC
CTCTACGGCT TCGAAACGAC CGGTGAACGG TCGAACCCGC CGTTCGCCAC TCGCGTCGCA
CTGAGCCACT CAGGGACCGG CTGCGAGGCG GTCCGAGACG GCCTCGTCGA CATCGGGAAC
TCCTCGGGAC CGATAACCGC AGAACTCGAC ATCAGCGAAG AGCAGCGCGA CGAGAACTAC
GTCGACCACG TCGTCGGCCG CGACGGTCAG CCGGTGTTCG TCAGTCAGGC AATCTACGAC
GCCGGCGTCG AACAGCTCAC TGGTGAGCAG ATCCGCGGAA TCTACCAAGG CGACATCACC
AACTGGAGCG AGGTCGGCGG CCCGGATCGG GAAATCTTCG TTGTCGGCCG CGCAGAGGGA
TCGGGTACGG ACACCTCGTT CCGACTGAAC ATGCTCGGCG ACGCCGACGC GCCGATGGAT
GTCGACAGTC GCTTCGGACA GAACCAGCAG GTCCAGCAGG TCCTCCAAGA CAACGACAAC
GCCATCGGCT ACATGGCGCT AGCGTTCTCC GGGTCGGGGA TTCAGGCGAT CGGCATCGAG
TTTGAGGGGA CGTTGTTCGA GCCGGATGCC GACGCCGAGA ACACCATCTT CGACTCGGAG
TACCCGCTGA ACCGGGACCT CCACATGTAC ACCCGGATCA ACGAGGATAC GCCCGAGGGG
ACAGACATGC GTGAGGCCGC CTTCCTCAAC ATGTTCCTGA CCGAGTTCGG CCAGCAGACG
TTCGTCGAGG ACGTCAACTA CATCACGCTG CCCACCTCCG ACATCGAGGC CGAACGCGAG
AAGCTCCCCG ACCAGGCGTA A
 
Protein sequence
MSRDSGRQAS AVSRRKYLLT SAAIGTAGLA GCSDSDSGGT GSDSGGDEGS NSIESGSSGP 
STVTAEGSST VYPISNRGSS YWNSNSPASD GEYWGSNDES SVAGWDQIET DQNIADYFAS
LYGFETTGER SNPPFATRVA LSHSGTGCEA VRDGLVDIGN SSGPITAELD ISEEQRDENY
VDHVVGRDGQ PVFVSQAIYD AGVEQLTGEQ IRGIYQGDIT NWSEVGGPDR EIFVVGRAEG
SGTDTSFRLN MLGDADAPMD VDSRFGQNQQ VQQVLQDNDN AIGYMALAFS GSGIQAIGIE
FEGTLFEPDA DAENTIFDSE YPLNRDLHMY TRINEDTPEG TDMREAAFLN MFLTEFGQQT
FVEDVNYITL PTSDIEAERE KLPDQA