Gene Hhal_1384 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1384 
Symbol 
ID4711332 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1491265 
End bp1492839 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content65% 
IMG OID639855851 
Productcholine/carnitine/betaine transporter 
Protein accessionYP_001002953 
Protein GI121998166 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID[TIGR00842] choline/carnitine/betaine transport 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.095402 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGTGCAC AGAAAGGCCC GCTGAAAGGG CTGAATATTC CGTTGACGGG CACGGCCACG 
CTGATCGTGC TCGCTTTCCT CATCTTCGGT GCCTGGGATC CGGAATACGC CGAGACGGTC
TTCGAGGGCA TCAGCGGCTG GGTCATCGAG ACCTTCAAGT GGTACTACAT CGGGGTCGTC
GCGTTCTTCC TGCTCTTCGC GCTGTTCCTG ATGTTCAGCC GCTTCGGTGA CCTCAAGCTT
GGGGACGACG ACCGGCCACC GGAGTTCAGT TACTTCGCCT GGTTCTCCAT GCTCTTCGGT
GCCGGTATGG GCATCGGCCT GTTGTTCTGG AGCATCGCCG AGCCGGTCTG GCACTTCCAG
GGCAACCCCT TCATCGACGA GGGTGAGACG GCCGCCGCGG CTGACTCGGC CATGCGCCTG
ACCTACTTCC ACTGGGGTAT GCACCCGTGG GCCATCTACG CCATCGTGGC GCTTTCGCTG
GCCTTCTTCT GCTACCGCAA GAAGCTGCCG CTGGCCATCC GTTCGGCGCT CTATCCGCTC
ATCGGCAACC GCATCTACGG CCCGATCGGC CACGCGGCGG ACGTCCTGGC GGTCTTCGGC
ACCATCTTCG GCGTGGCCAC CTCCCTGGGG TTCGGTGCAA TCCAGATCAA CACCGGCCTC
AACGAACTCA CCGGGCTTGA GCTCAGTGTC ACCAACCAGC TGCTGATCGT CGCCGTGGTC
ACCCTGATTG CCGTGGGTTC GGTGATCTCC GGCGTGGGGC GGGGCGTGAA GGTCCTCTCG
CAGCTGAACC TGATCCTCAG CGCCGTGATC CTGCTCTTCT TCCTGAGCTT CGGGCCGACC
CTTTATCTGC TCTCGAGCTT CGTCCAGGGC ATCGGCGACT ACCTGCAGAA CGTGGTCTAC
CTCAGCTTCT GGACCGACGC CAGCGGGGCC CGTGAGGCCG GCGACTGGCA GCTCTCGTGG
ACGGCCTTCT ATTGGGGCTG GTGGATCGCC TGGGCCCCCT TCGTGGGTAT GTTCATCGCC
CGCATCTCCC GCGGTCGCAC CATCCGCGAG TTTCTGGGCG GCGTGCTGCT GGTGCCGACC
CTGCTCGCCC TGGGCTGGCT GACCGTCTTC GGTGGCACCG GCCTGTACCA GGAGCTCTTC
GGTGCCGGGG GGCTGGTCGA GGCGGTGTCC GAGGATGAGA CCATCGCCCT TTACTACACC
ATCGAAGCGG TGGCCCCCGG GGTGATTGCC ACCATCTTTG CGGCGATCGC CACGGTCCTC
ATCGCCACCT ACTTCATCAC CTCGTCGGAC TCGGCGACGC TGGTGGTGAC CATGCTGCTG
TCTGTCGGCA ACACCGAGCC GCCGACCTAC CAGCGGGCCT TCTGGGGCGT GGCCGAAGGC
TGCGTGGCCG CGGTACTGCT GGTGGCTGGT GGCTTGGTTG CCCTGCAGGC GGCGGCCATC
GTCGCGGCGT TGCCGTTCTC GCTGCTGATG CTGCTGATGT GCTACGCCCT GATCCGCGGG
CTGCAGGAGG AGAAGCGGCG TATGCAGCTC TCCTGGCAGC CGGGGCAGGG GCCGCCGGCG
GCACCGCATC TGTGA
 
Protein sequence
MRAQKGPLKG LNIPLTGTAT LIVLAFLIFG AWDPEYAETV FEGISGWVIE TFKWYYIGVV 
AFFLLFALFL MFSRFGDLKL GDDDRPPEFS YFAWFSMLFG AGMGIGLLFW SIAEPVWHFQ
GNPFIDEGET AAAADSAMRL TYFHWGMHPW AIYAIVALSL AFFCYRKKLP LAIRSALYPL
IGNRIYGPIG HAADVLAVFG TIFGVATSLG FGAIQINTGL NELTGLELSV TNQLLIVAVV
TLIAVGSVIS GVGRGVKVLS QLNLILSAVI LLFFLSFGPT LYLLSSFVQG IGDYLQNVVY
LSFWTDASGA REAGDWQLSW TAFYWGWWIA WAPFVGMFIA RISRGRTIRE FLGGVLLVPT
LLALGWLTVF GGTGLYQELF GAGGLVEAVS EDETIALYYT IEAVAPGVIA TIFAAIATVL
IATYFITSSD SATLVVTMLL SVGNTEPPTY QRAFWGVAEG CVAAVLLVAG GLVALQAAAI
VAALPFSLLM LLMCYALIRG LQEEKRRMQL SWQPGQGPPA APHL