Gene Hhal_0237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0237 
Symbol 
ID4709928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp272309 
End bp273961 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content67% 
IMG OID639854697 
Productcholine/carnitine/betaine transporter 
Protein accessionYP_001001833 
Protein GI121997046 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID[TIGR00842] choline/carnitine/betaine transport 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000527978 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCGATC CCAACAACAC CGATCCGAAA GAGGTGAAGA AGGAGATCGA GGAGCTGGAG 
CAGGCCTACG AGACCGACCA CGAGATCGGC GATCAGAACA TCAGCACCGA GATCAAGCCC
ATCGGGCTGG CGCTGGATCT GCACAACCCG GTCTTCATCG TCAGCTCCGC GCTGATCCTC
GTCTTCCTCA TCGGCACCCT CATCTTCACG GCCCCCGCCC AGGAGGCGCT GGAAGGCGTC
CGCGGCTGGG CCACCAGCAG CTTCGACTGG TTCTTCCTCA CCGCCGGCAA CATCTTCGTC
CTCTTCTGCC TGCTGCTGAT CGTCCTGCCC CTGGGCAGCA TCCGCATCGG CGGACAGGAC
GCGAAGCCGG ACTTCTCGCG ACTGTCCTGG TTCACCATGC TCTTCGCCGC CGGCATGGGC
ATCGGCCTGA TGTTCTGGGC GGTGGCCGAG CCGGTGGGCT ACTACACCGA GTGGTTCGGC
TCGCCGTTCA ACATCGAGGG CGGCACCGAC GAGGCGGCCA AGGCGGCCAT GGGTGCGACC
ATGTACCACT GGGGCCTGCA CCCGTGGGCC ATCTACGGCG TCATGGCGCT GGCCCTGGCC
TTCTTCACCT ACAACAAGGG GCTGCCGCTG ACCGTGCGCT CGGTCTTCTA CCCCCTCCTG
GGTGAGCGGG TGTGGGGGCC GCTGGGCCAC ATCATCGACA CCGTGGCGGT GCTGGCCACC
ATCTTCGGCC TGGCCACCTC CCTGGGCTTC GGCGCCCAAC AGGCGGCCAG CGGCCTGAGC
TACGTCTTCG AGGCCGTGCC CGATACTCTG GGCACCCAGG TGGCGATCAT CATCGGCGTC
ACGGTGGCGG CGCTCGTCTC GGTGCTGCGC GGCATCGACG GCGGCATCAA GCTGCTCAGC
AACCTCAACA TCAGCCTCGC CGGGCTGCTG ATGCTCTTCG TCATCATCGC CGGCGGCGCC
ATCGCCTTCG TCACCCAGCT CTGGCACACC ACCAGCGCCT ACGCCGGGGA CTTCTTCGCC
CTCTCCAACC CGGTGGGCCG CGAGGACGAG ACCTTCCTCC AGGGCTGGAC GGCCTTCTAC
TGGGCGTGGT GGATCAGCTG GTCGCCCTTC GTCGGCATGT TCATCGCCCG GGTCTCCCGC
GGCCGCACGG TGCGCGAGTT CATGACCGCG GTGCTGATCG TGCCCACGGT GGTGACCATC
TTCTGGATGA GCGCCTTCGG CGGCGTGGGC CTGCAGCAGG CCATCGAGGG CATCGGTGCC
CTGGCCGACG GCATCGGCGC CGACGAGTCC ATGGCCCTGT TCCACATGCT GGAGCAGCTG
CCCTGGACCC TGCTCACCGC CTCGGTGGCG GTCTTCCTGG TGCTGGTCTT CTTCGTGACC
TCGTCGGACT CCGGCTCGCT GGTGATCGAC AGCATCACCG CCGGCGGCAA GACCGACGCC
CCGGACGCCC AGCGCGTCTA TTGGGTGGTC ATGGAGGGCC TGATCGCCGG TGTGCTGCTG
TTCATCGGCG GGGACGCCGC CCTCAGCGCC CTGCAGGCGG GGGCGGTCTC GGCCGGGCTG
CCGTTCACCG TGGTCCTGCT CCTGGTCTGC CTGAGTCTGC TGATCGGGCT GCGCCACGAG
CGGCGGCTGA TCAAGCTGAC CCAACAGGCC TGA
 
Protein sequence
MTDPNNTDPK EVKKEIEELE QAYETDHEIG DQNISTEIKP IGLALDLHNP VFIVSSALIL 
VFLIGTLIFT APAQEALEGV RGWATSSFDW FFLTAGNIFV LFCLLLIVLP LGSIRIGGQD
AKPDFSRLSW FTMLFAAGMG IGLMFWAVAE PVGYYTEWFG SPFNIEGGTD EAAKAAMGAT
MYHWGLHPWA IYGVMALALA FFTYNKGLPL TVRSVFYPLL GERVWGPLGH IIDTVAVLAT
IFGLATSLGF GAQQAASGLS YVFEAVPDTL GTQVAIIIGV TVAALVSVLR GIDGGIKLLS
NLNISLAGLL MLFVIIAGGA IAFVTQLWHT TSAYAGDFFA LSNPVGREDE TFLQGWTAFY
WAWWISWSPF VGMFIARVSR GRTVREFMTA VLIVPTVVTI FWMSAFGGVG LQQAIEGIGA
LADGIGADES MALFHMLEQL PWTLLTASVA VFLVLVFFVT SSDSGSLVID SITAGGKTDA
PDAQRVYWVV MEGLIAGVLL FIGGDAALSA LQAGAVSAGL PFTVVLLLVC LSLLIGLRHE
RRLIKLTQQA