Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0233 |
Symbol | |
ID | 4709327 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 267589 |
End bp | 268788 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639854693 |
Product | glycine betaine/L-proline ABC transporter, ATPase subunit |
Protein accession | YP_001001829 |
Protein GI | 121997042 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4175] ABC-type proline/glycine betaine transport system, ATPase component |
TIGRFAM ID | [TIGR01186] glycine betaine/L-proline transport ATP binding subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.861069 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGAAA AACTCGTCGT AGAAGACCTC TACAAGATTT TCGGCCCGAA ACCCGAACGG GCCATGGAGC TGCTGAAACA GGGCTATGAC AAGGACGCGA TCTTCCAGCG CACCGGCAAT ACCGTCGGTG TGCGGGAAGC GAGTTTCACC ATCCACGAGG GCGAAGTCTT CGTCATCATG GGCCTGTCGG GTTCCGGCAA ATCGACCATG GTGCGGATGT TCAACCGCCT CATCGACCCG ACCTACGGCC ACATCTACCT CGACGGCCAG GACATCATGC AGATGAGCCA GCAGGAGCTC ATCGACATGC GCCGCCGCGA CATGTCCATG GTGTTCCAGT CCTTCGCCCT GCTCCCGCAC AAGACCGTTG CCGAGAACGC CGCCTTCGGC CTCGACGTCT CCGGCCACGG CGCGACGGAG CGGCGCGAGA AGGCGCTCAA GGCCCTGGAG GCTGTGGGCC TGGCGGCCAA CGCCGACAGC TTCCCGGACG AGCTCTCCGG CGGCATGAAA CAGCGCGTGG GGCTGGCCCG CGCCCTGGCG ACGGATCCAA CCATTCTGCT CATGGACGAG GCCTTCTCGG CCCTCGACCC ACTGATCCGT ACCGAGATGC AGGATGAGCT CATCCGCCTG CAGCAGGAGC AGAGCCGCAC CGTCATCTTC ATCTCCCACG ACCTGGACGA GGCCATGCGC ATCGGCGATC GCATCGCCAT CATGGAGGGC GGCCAGGTGG TGCAGATCGG CACGCCGGAG GAGATCGTCT CCAACCCGGC CAACGAGTAC GTGCGCTCCT TCTTCTACGG CGTCGACGTC AGCCAGGTCT ACAACGCCGG CGACATCGCC GAGCGCCGCC GGGTCACCGT CATCCAGCGC CCCGGCGTGG ACGTGCGCAC CGCACTCGAG CGGCTCAAGC ACTACGCCAG CGACATCGGC GTGGTCATCG ACCGCAGTCG CCGCTTCCAG GGCCTGGTCT CCATCGACAG CCTGACCGAC GCCATCCGCA GCGGCGGCAG CACCGAATAC GGCGACGCCT TCCTCAGTGG CGTGCAGACC ATCCCGGCCG ACACGGCAAT CTCCGACGTG CTCGGGCCGT CGGCCGAGAG CGAGTACCCG CTGGTCGTGG TGGATGACGA CGGGCACTAC GTCGGCACCC TCTCGCGCAG CCAGCTGCTG CTCACCCTGG ATCGCACCCA ACAGGCGTAA
|
Protein sequence | MTEKLVVEDL YKIFGPKPER AMELLKQGYD KDAIFQRTGN TVGVREASFT IHEGEVFVIM GLSGSGKSTM VRMFNRLIDP TYGHIYLDGQ DIMQMSQQEL IDMRRRDMSM VFQSFALLPH KTVAENAAFG LDVSGHGATE RREKALKALE AVGLAANADS FPDELSGGMK QRVGLARALA TDPTILLMDE AFSALDPLIR TEMQDELIRL QQEQSRTVIF ISHDLDEAMR IGDRIAIMEG GQVVQIGTPE EIVSNPANEY VRSFFYGVDV SQVYNAGDIA ERRRVTVIQR PGVDVRTALE RLKHYASDIG VVIDRSRRFQ GLVSIDSLTD AIRSGGSTEY GDAFLSGVQT IPADTAISDV LGPSAESEYP LVVVDDDGHY VGTLSRSQLL LTLDRTQQA
|
| |