Gene Hhal_0233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0233 
Symbol 
ID4709327 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp267589 
End bp268788 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content66% 
IMG OID639854693 
Productglycine betaine/L-proline ABC transporter, ATPase subunit 
Protein accessionYP_001001829 
Protein GI121997042 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4175] ABC-type proline/glycine betaine transport system, ATPase component 
TIGRFAM ID[TIGR01186] glycine betaine/L-proline transport ATP binding subunit 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.861069 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGAAA AACTCGTCGT AGAAGACCTC TACAAGATTT TCGGCCCGAA ACCCGAACGG 
GCCATGGAGC TGCTGAAACA GGGCTATGAC AAGGACGCGA TCTTCCAGCG CACCGGCAAT
ACCGTCGGTG TGCGGGAAGC GAGTTTCACC ATCCACGAGG GCGAAGTCTT CGTCATCATG
GGCCTGTCGG GTTCCGGCAA ATCGACCATG GTGCGGATGT TCAACCGCCT CATCGACCCG
ACCTACGGCC ACATCTACCT CGACGGCCAG GACATCATGC AGATGAGCCA GCAGGAGCTC
ATCGACATGC GCCGCCGCGA CATGTCCATG GTGTTCCAGT CCTTCGCCCT GCTCCCGCAC
AAGACCGTTG CCGAGAACGC CGCCTTCGGC CTCGACGTCT CCGGCCACGG CGCGACGGAG
CGGCGCGAGA AGGCGCTCAA GGCCCTGGAG GCTGTGGGCC TGGCGGCCAA CGCCGACAGC
TTCCCGGACG AGCTCTCCGG CGGCATGAAA CAGCGCGTGG GGCTGGCCCG CGCCCTGGCG
ACGGATCCAA CCATTCTGCT CATGGACGAG GCCTTCTCGG CCCTCGACCC ACTGATCCGT
ACCGAGATGC AGGATGAGCT CATCCGCCTG CAGCAGGAGC AGAGCCGCAC CGTCATCTTC
ATCTCCCACG ACCTGGACGA GGCCATGCGC ATCGGCGATC GCATCGCCAT CATGGAGGGC
GGCCAGGTGG TGCAGATCGG CACGCCGGAG GAGATCGTCT CCAACCCGGC CAACGAGTAC
GTGCGCTCCT TCTTCTACGG CGTCGACGTC AGCCAGGTCT ACAACGCCGG CGACATCGCC
GAGCGCCGCC GGGTCACCGT CATCCAGCGC CCCGGCGTGG ACGTGCGCAC CGCACTCGAG
CGGCTCAAGC ACTACGCCAG CGACATCGGC GTGGTCATCG ACCGCAGTCG CCGCTTCCAG
GGCCTGGTCT CCATCGACAG CCTGACCGAC GCCATCCGCA GCGGCGGCAG CACCGAATAC
GGCGACGCCT TCCTCAGTGG CGTGCAGACC ATCCCGGCCG ACACGGCAAT CTCCGACGTG
CTCGGGCCGT CGGCCGAGAG CGAGTACCCG CTGGTCGTGG TGGATGACGA CGGGCACTAC
GTCGGCACCC TCTCGCGCAG CCAGCTGCTG CTCACCCTGG ATCGCACCCA ACAGGCGTAA
 
Protein sequence
MTEKLVVEDL YKIFGPKPER AMELLKQGYD KDAIFQRTGN TVGVREASFT IHEGEVFVIM 
GLSGSGKSTM VRMFNRLIDP TYGHIYLDGQ DIMQMSQQEL IDMRRRDMSM VFQSFALLPH
KTVAENAAFG LDVSGHGATE RREKALKALE AVGLAANADS FPDELSGGMK QRVGLARALA
TDPTILLMDE AFSALDPLIR TEMQDELIRL QQEQSRTVIF ISHDLDEAMR IGDRIAIMEG
GQVVQIGTPE EIVSNPANEY VRSFFYGVDV SQVYNAGDIA ERRRVTVIQR PGVDVRTALE
RLKHYASDIG VVIDRSRRFQ GLVSIDSLTD AIRSGGSTEY GDAFLSGVQT IPADTAISDV
LGPSAESEYP LVVVDDDGHY VGTLSRSQLL LTLDRTQQA