Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_3551 |
Symbol | |
ID | 7402394 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012030 |
Strand | + |
Start bp | 298078 |
End bp | 299238 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643710089 |
Product | ABC-type phosphate transport system, substrate binding protein |
Protein accession | YP_002567655 |
Protein GI | 222481419 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0226] ABC-type phosphate transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR02136] phosphate binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCACGCG ACTCAGGGCG TCAGGCGAGC GCGGTATCAC GTCGAAAGTA TCTGCTGACG TCGGCCGCCA TCGGGACGGC GGGCCTTGCC GGGTGTAGCG ACAGCGATAG CGGAGGCACC GGAAGCGACA GCGGAGGCGA CGAGGGCAGT AACTCTATCG AATCTGGCTC CTCGGGGCCT TCGACGGTGA CCGCCGAGGG ATCCTCGACG GTGTACCCTA TCTCGAACAG GGGGAGTTCC TACTGGAACT CGAACTCGCC GGCGAGCGAC GGCGAGTACT GGGGCTCGAA CGACGAGTCG TCGGTCGCCG GCTGGGACCA GATCGAGACC GACCAGAACA TCGCCGACTA CTTCGCGAGC CTCTACGGCT TCGAAACGAC CGGTGAACGG TCGAACCCGC CGTTCGCCAC TCGCGTCGCA CTGAGCCACT CAGGGACCGG CTGCGAGGCG GTCCGAGACG GCCTCGTCGA CATCGGGAAC TCCTCGGGAC CGATAACCGC AGAACTCGAC ATCAGCGAAG AGCAGCGCGA CGAGAACTAC GTCGACCACG TCGTCGGCCG CGACGGTCAG CCGGTGTTCG TCAGTCAGGC AATCTACGAC GCCGGCGTCG AACAGCTCAC TGGTGAGCAG ATCCGCGGAA TCTACCAAGG CGACATCACC AACTGGAGCG AGGTCGGCGG CCCGGATCGG GAAATCTTCG TTGTCGGCCG CGCAGAGGGA TCGGGTACGG ACACCTCGTT CCGACTGAAC ATGCTCGGCG ACGCCGACGC GCCGATGGAT GTCGACAGTC GCTTCGGACA GAACCAGCAG GTCCAGCAGG TCCTCCAAGA CAACGACAAC GCCATCGGCT ACATGGCGCT AGCGTTCTCC GGGTCGGGGA TTCAGGCGAT CGGCATCGAG TTTGAGGGGA CGTTGTTCGA GCCGGATGCC GACGCCGAGA ACACCATCTT CGACTCGGAG TACCCGCTGA ACCGGGACCT CCACATGTAC ACCCGGATCA ACGAGGATAC GCCCGAGGGG ACAGACATGC GTGAGGCCGC CTTCCTCAAC ATGTTCCTGA CCGAGTTCGG CCAGCAGACG TTCGTCGAGG ACGTCAACTA CATCACGCTG CCCACCTCCG ACATCGAGGC CGAACGCGAG AAGCTCCCCG ACCAGGCGTA A
|
Protein sequence | MSRDSGRQAS AVSRRKYLLT SAAIGTAGLA GCSDSDSGGT GSDSGGDEGS NSIESGSSGP STVTAEGSST VYPISNRGSS YWNSNSPASD GEYWGSNDES SVAGWDQIET DQNIADYFAS LYGFETTGER SNPPFATRVA LSHSGTGCEA VRDGLVDIGN SSGPITAELD ISEEQRDENY VDHVVGRDGQ PVFVSQAIYD AGVEQLTGEQ IRGIYQGDIT NWSEVGGPDR EIFVVGRAEG SGTDTSFRLN MLGDADAPMD VDSRFGQNQQ VQQVLQDNDN AIGYMALAFS GSGIQAIGIE FEGTLFEPDA DAENTIFDSE YPLNRDLHMY TRINEDTPEG TDMREAAFLN MFLTEFGQQT FVEDVNYITL PTSDIEAERE KLPDQA
|
| |