Gene Csal_0654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0654 
Symbol 
ID4026339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp732371 
End bp734005 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content62% 
IMG OID637965824 
Productcholine/carnitine/betaine transport 
Protein accessionYP_572714 
Protein GI92112786 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID[TIGR00842] choline/carnitine/betaine transport 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCAACA ATGGAAGGCA ACGCGTATTT ATCGTCTCCG CCTTGATCGT CGCGGTTCTC 
GTCGCCATCG GGGCGGCGTT TCCCGAGCGT TTCGGCAGCG CGGCGTCGTC GGCATTATCC
GGCATTTCGC ACTATTTCGG GTGGTTCTAC CTGTTTTCGG TGTTCGGGTT CGTGGTGTTC
TTGCTGACGC TGGCGTGCAG CAAGTATGGC AAGATTCGGC TCGGCCCCCA GGACAGTTCG
CCCTCCTACA GTTTCTTCTC GTGGGTCAGC ATGCTGCTGG CGGCGGGGTT CGGCGTCGGC
CTGGTGTTCT ACGGCATGGC CGAGCCGATG ACGCACTTCC TCGAACCGCC CTACGGTGAT
GTGGAAGGCG GTACCGAAGA GGCGGCACGC TATGCCATCC AATACAGCTT CTTCAACTGG
GGCATTCATC AGTGGGCCGC GTTTTCCGTG GTGGGGCTGA TCATTGCCTA TTATCAGTTC
CGCAAGGGCC AGGCGGGGCT GGTCTCCAAC GTGCTTTCGA GCATGACCGC CAAGCGTCCC
AAGATGCGCA AGCTGGGACC GGCGCTGGAT GTCTTTGCCG TGGTCGCGAC GGTGATGGGG
GTGGCGACCT CCATCGGTCT GGCGGTGCTG CAGATCAACG GCGGCCTGCA TGCGGTCTTC
GGTGTCGAAG AGGGCATGAC GTGGCAGTTC ATCATCATGG GGGCGATGTT TTTGTGTTAC
ATGGCCTCCA CCTGGTCGGG GCTGGACAAG GGCATCAAGC GCCTTTCCAA CCTCAACATG
GCGCTGTGCT TCGCGTTGAT GTTCTACGTG CTGTTCACGG GCCCCACCGT GGCCATTCTC
GAGACCATCA CCCTGGGGAT CGGCGATTAC CTGCAGAACA TCGTGGGCAT GAGCCTGCGG
GTCGCGCCGT ATAGCGACAA CACCTGGGCC AGCAACTGGA CGATCTTCTA TTGGGCCTGG
GTCATCGCCT GGTCGCCGTT CGTGGGCACC TTCGTGGCGC GCGTCTCGCG TGGGCGCACC
ATCAAGGAGT ACGTGTTCGG CGTGTTGATC GTGCCGCCGC TGCTGGCCTG CCTGTGGATC
GGGGTCTTCG GCGGCGCGGC GCTCAACATG GAGCTCACCG GCGACGTGGG ACTGGCCTCG
GCCACGGCAG ACAACATCAC GGTGGCGCTG TTCCGGATGT TCGAGCTGAT GCCGTTCTCC
AATGTGCTGT CGGTGGTGGC GCTGTCGCTG ATCTTCATTT TCCTGGTGAC CTCGGCGGAC
TCGGCGACCT ATATCGTGTC GCAGATGACC GATGGCGGTT CGCTGAATCC GCCGCTGTTC
AAGCGGGTGA TCTGGGGGGT ACTGATCGCG GCGATCTGTC TGACCCTGCT GATTGCCGGC
GGGTTGAATG GCCTGCAATC GGCGGCGGTG CTGGCGGCGT TACCCTTCAC CTTCATCCTG
TACGGCATGA TTGCCGTGCT GGTGAAGGAA TTGCGCGCCG ATCGCAAGGC GATGCTGACA
TCGCTTTATC ATCGTCATGG GGAAACGCCG GTAGGCGCCG ATGCCTTCGA GGCGGAAACG
CTGGCGGAAG CCGAGCGGTA CCGGCGTGCA CCGAACGTGG TCAACCGGCG CATCAATACG
CGCGACGGTA CCTGA
 
Protein sequence
MANNGRQRVF IVSALIVAVL VAIGAAFPER FGSAASSALS GISHYFGWFY LFSVFGFVVF 
LLTLACSKYG KIRLGPQDSS PSYSFFSWVS MLLAAGFGVG LVFYGMAEPM THFLEPPYGD
VEGGTEEAAR YAIQYSFFNW GIHQWAAFSV VGLIIAYYQF RKGQAGLVSN VLSSMTAKRP
KMRKLGPALD VFAVVATVMG VATSIGLAVL QINGGLHAVF GVEEGMTWQF IIMGAMFLCY
MASTWSGLDK GIKRLSNLNM ALCFALMFYV LFTGPTVAIL ETITLGIGDY LQNIVGMSLR
VAPYSDNTWA SNWTIFYWAW VIAWSPFVGT FVARVSRGRT IKEYVFGVLI VPPLLACLWI
GVFGGAALNM ELTGDVGLAS ATADNITVAL FRMFELMPFS NVLSVVALSL IFIFLVTSAD
SATYIVSQMT DGGSLNPPLF KRVIWGVLIA AICLTLLIAG GLNGLQSAAV LAALPFTFIL
YGMIAVLVKE LRADRKAMLT SLYHRHGETP VGADAFEAET LAEAERYRRA PNVVNRRINT
RDGT