Gene Hhal_1851 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1851 
Symbol 
ID4711268 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2022401 
End bp2024032 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content66% 
IMG OID639856323 
Productcholine/carnitine/betaine transporter 
Protein accessionYP_001003417 
Protein GI121998630 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID[TIGR00842] choline/carnitine/betaine transport 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.993809 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCAACG TAGCGACCCG CGGTTTCTTC AGAGGCATGA GCCCCCGGGT GACCGCCATC 
TCGACCTTCC TGGTAGCGGC TTTCGCCCTG GCCGGAGCCA TCTGGCCGAA ACACCTGGAG
GCGGTCGTCA CCGGGTGGCG CGAATCGCTG ACCCCGTTCC TGCAGTGGTA CTACGTGCTG
GTGGTGGCCG CCTTCCTGCT ACTGGTGATC TGGCTGGGCA CGGGGCGGTT CAAGAATGTG
CGTCTGGGCC AGGATCACGA GGTGCCGGAG TTCCGTACCT TCTCCTGGCT GACCATGCTG
TTTGCCGCAG GCATGGGGGT GGGCCTGATC TTCTGGGCCG TCGCCGAACC CATCTCCCAC
TTCGACAGCA ACCCGTTCAC AGTCTCCGGC GACACCACCG AAGCCGCCGA CACGGCGCTG
CGCCTGGCCT ACTTTCACTG GGGCCTCAAC GGCTGGGCCG TCTTCTCCCT GGTGGCGCTG
ATCCTCGCCT ACTTCAGCTT CCGCCGCGGC CTGCCGCTGA CCATGCGCTC GGCCTTCTAC
CCGCTGATCG GTAAGCACAT CCACGGGCCT TGGGGCGATG CCGTGGACAT CCTGGCGGTG
CTGGCCACCG TCTTCGGCAT CGCCACCACC CTGGGGCTCG GCATCCAGCA GCTGAACACC
GGCATCGGCG AACTTACCGG GATCACGGCG GGCACCACCG GCCAAATCGC CATCGCTATC
ACCGTGATGG GGATCGCCAC CATCTCGGTG CTCTACGGCG TGCAGTCCGG GGTCCGCCTG
ATCAGCGAGG CCAACTTCTG GATGAGCGCG GCAGTGCTGC TCTTCTTCCT GCTCTGGGGC
CCCACCCAGT ACCTGCTGGC GCTGATCGTG CAATCCACCG GCGACTACCT GCAGAACCTG
TTTACCCTCT CGTTCCACAC CCATGCCAAC GCCCTTGGCG ACTGGCAGGC GGAATGGACC
CTCTTCTACT GGGGGTGGTG GCTGGCCTGG GCCCCCTTCG TCGGGATCTT CATCGCCCGC
ATCTCGCGGG GGCGCAAGCT GCGCGAGTTC GTCATGGGCG TGCTGCTGGT GCCCACCGGC
ATCACCATCG TCTGGATCGG GCTATTCGGC GGCAACGCCA TCCACATCGA GCTCTTCGGC
CCCGGCGGGG TCGTCGACGC CACCCGCGAG GAGGTCAGCA CCGCGGTCTT CCGAACCATC
GAGTTGATGG ACGTCGGCAT CTGGGCCACG GCGGCCTCGA TCCTCGTCAC CGTGCTCATA
GCCACCTATC TGATCACTTC CGCCAACGCC GGCATCCTGG TCACCCAGAC CCTGCTGTCC
AACGGTTCGA CGGAGATCTC CCGGCTGCAC ACCGTGATTT GGGGGACCGT CATCACCCTG
GTGACCATCG TGCTGCTGAC CGCCGGGGGC CTGACTACCC TCCAGGGTGC GGTGATCGCC
GCGGCAGTGC CCTTCTCCTT CATCATCATC GGCATGGTGG TGGGACTGCT CAAGGCCCTG
GAGCAGGAGG CCTTCGCCCC GCGGCCGGGC GAGCGCAGCG GCGCACCCAT GGAGCCCTGG
GCCCAGGTCG AGTCGGACTG GCACACCAGC GAAACCCACA CCGACACGGC CACCGACCGG
ACGGAGGACT GA
 
Protein sequence
MFNVATRGFF RGMSPRVTAI STFLVAAFAL AGAIWPKHLE AVVTGWRESL TPFLQWYYVL 
VVAAFLLLVI WLGTGRFKNV RLGQDHEVPE FRTFSWLTML FAAGMGVGLI FWAVAEPISH
FDSNPFTVSG DTTEAADTAL RLAYFHWGLN GWAVFSLVAL ILAYFSFRRG LPLTMRSAFY
PLIGKHIHGP WGDAVDILAV LATVFGIATT LGLGIQQLNT GIGELTGITA GTTGQIAIAI
TVMGIATISV LYGVQSGVRL ISEANFWMSA AVLLFFLLWG PTQYLLALIV QSTGDYLQNL
FTLSFHTHAN ALGDWQAEWT LFYWGWWLAW APFVGIFIAR ISRGRKLREF VMGVLLVPTG
ITIVWIGLFG GNAIHIELFG PGGVVDATRE EVSTAVFRTI ELMDVGIWAT AASILVTVLI
ATYLITSANA GILVTQTLLS NGSTEISRLH TVIWGTVITL VTIVLLTAGG LTTLQGAVIA
AAVPFSFIII GMVVGLLKAL EQEAFAPRPG ERSGAPMEPW AQVESDWHTS ETHTDTATDR
TED