Gene Hhal_2364 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2364 
Symbol 
ID4709093 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2594084 
End bp2596141 
Gene Length2058 bp 
Protein Length685 aa 
Translation table11 
GC content66% 
IMG OID639856839 
Productcholine/carnitine/betaine transporter 
Protein accessionYP_001003929 
Protein GI121999142 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID[TIGR00842] choline/carnitine/betaine transport 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGATC GCAAGCGCGC CTTCCGCACC ACCATCCTCG CCCCGGTCTT CTTCCCCGCC 
ATCGCTGTTG CGCTGCTGCT GATCATCGGG GCGATCAGCA GCCCGGACCT GGCCGGGGCG
TTCTTCGAGG ACTTGCTCGC CTTCATCACC GAGACCTTCG GCTGGTTCTA CATGCTGGCG
GTGGCGGCGT TCCTGGTCTT TCTGGTGGCG GTTGCCTTCA CCCGGTGGGG CCATATCAAG
CTCGGGCCGG AGCACGGCGA GCCGCAGTAC AGCTTCCCCG CGTGGTTCGC CATGCTCTTC
TCGGCGGGCT ACGGGATCGT GCTGCTCTTC TTCGGCGTGG CCGAGCCGGT GCTCCACTAC
GCCGATCCGC CCCGGGGTGA GCCGGAGACG ATTGAGGCGG CGCGCCAGGC GATGCAGATC
GCCTTCTTCC ACTGGGGCTT CCACATCTGG GCCATCTACG GCCTGGTGGG GTTGGTGCTG
GCGTACTTCT CGTTCCGCCA CGGCCTGCCG CTGTCGATCC GTTCGGCGCT CTACCCGCTG
ATCGGCGATC GCATCTACGG GCCCATCGGC CATACGGTGG ATGTCTTCGC CATCCTCGGC
ACGCTGTTCG GCATCGCCAC CACGCTGGGG CTGTCGGTGG CGCAGATCAA CGCCGGGCTC
AACTACCTGT GGCCGTCGAT CCCCACCAGC ACCACGGTGC AGGTCATCGT CATCGCGGTG
ATTACGGCGC TGGCGACCAT CTCCGTGGTC GCCGGCCTGG ACAAGGGCAT CAAGCGGCTG
TCGATCCTCA ACATGATCCT CGCCGCAGCG CTGATGCTCT TCGTCTTCCT GGTCGGCCCG
TCGATCCTGA TCGTGGAGAC CTTCCTGCAG AACACCGGCA GCTACGTCAG CGGCATCGTC
GAGCGCACCT TCAATCTCGA GGCCTACGAG CGGCGGGAGT GGATCGGCAA CTGGACGCTG
TTCATCTTCG GCTGGACCAT CGCCTGGGCG CCGTTCGTGG GCATGTTCAT CGCCAAGATC
AGCCGCGGGC GGACCATCCG CCAGTTCGTC GTCGGCGTGA TGCTGGTGCC GACGCTCTTC
ACCTTCCTGT GGTTCTCGAT CTTCGGCGGC ACCGGCCTGA ACCTGATCAT GAACGAGGGC
TACGAGCAGC TCATCGGCCT TGTGCAGGAG GACGAGGCGG TGGCCCTGTT CCAGCTCTAC
GACATCCTGC CGTGGAGTGC CCTGGCGTCG TTCGTCACCG TGATCCTGAT CATGACCTTC
TTCGTCACCT CGTCGGACTC CGGCTCGCTG GTCATCGATC AGCTCGCCTC CGGCGGCGCG
TCGGTGACGC CGGTCTGGCA GCGGGTCTTC TGGGCGGTGC TCGAGGGCGC GGTGGCGGCG
GTGCTGCTCA TCGCCGGCGG ATTGGCGGCG CTGCAGACCA TGGCGGTGAC CAGTGCCCTG
CCGTTCGCGG TGATCATGCT CATCGCTGCC GGTGGGCTAT GGCGGGCCCT GATCATCGAG
AGCCACCACG ACACCAGCCT GCAGAATCAC GTCCAGCGCC GCCAGCGCTA CGGTACGCTG
CTGTGGAAGA AGCGGCTCTA CGAGCTGTTC GACTTCCCCA CCCGCGACGA CGTGATGGCA
TTCATCCGCG GCCCGGTGGT GCAGGCCCTG GAGCACGTTC AGAAGGCCCT CGACCAGCGC
GGCTGGCCGG CCAAAGTGGT GCTCGATGAG GATCACGGAC GGGTCTACCT AGCGGTACAC
CGCGACGGGT TGATGGACTT CCTCTACGAC GTGCGCCTGA CCGAGCGCCC GCGTCCGGCC
TTCGCCTACC CGTCCATCGA TCCCTCGGGC GGACCGGCTG AGGTCTACTA CCGCCCGGAG
GTCTACCTGC GCCGGGGCGG GCAGTCGTAC AGCGTCTACG AGTACAACGA GCAGGAGATC
ATCGACGACG TCCTCGATCA CTTCGAGAGC TATCTGCAGT TCCTCGACTC GGCGCCGGCC
ACGCTGCCGT GGGCGACGGA GGCGCACGAC GAGATGATCG ACGCCCCGGT GGGTGGCAAG
GGCCGCGGAC GGGGGTGA
 
Protein sequence
MIDRKRAFRT TILAPVFFPA IAVALLLIIG AISSPDLAGA FFEDLLAFIT ETFGWFYMLA 
VAAFLVFLVA VAFTRWGHIK LGPEHGEPQY SFPAWFAMLF SAGYGIVLLF FGVAEPVLHY
ADPPRGEPET IEAARQAMQI AFFHWGFHIW AIYGLVGLVL AYFSFRHGLP LSIRSALYPL
IGDRIYGPIG HTVDVFAILG TLFGIATTLG LSVAQINAGL NYLWPSIPTS TTVQVIVIAV
ITALATISVV AGLDKGIKRL SILNMILAAA LMLFVFLVGP SILIVETFLQ NTGSYVSGIV
ERTFNLEAYE RREWIGNWTL FIFGWTIAWA PFVGMFIAKI SRGRTIRQFV VGVMLVPTLF
TFLWFSIFGG TGLNLIMNEG YEQLIGLVQE DEAVALFQLY DILPWSALAS FVTVILIMTF
FVTSSDSGSL VIDQLASGGA SVTPVWQRVF WAVLEGAVAA VLLIAGGLAA LQTMAVTSAL
PFAVIMLIAA GGLWRALIIE SHHDTSLQNH VQRRQRYGTL LWKKRLYELF DFPTRDDVMA
FIRGPVVQAL EHVQKALDQR GWPAKVVLDE DHGRVYLAVH RDGLMDFLYD VRLTERPRPA
FAYPSIDPSG GPAEVYYRPE VYLRRGGQSY SVYEYNEQEI IDDVLDHFES YLQFLDSAPA
TLPWATEAHD EMIDAPVGGK GRGRG