Gene Csal_0223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0223 
Symbol 
ID4027306 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp249948 
End bp252008 
Gene Length2061 bp 
Protein Length686 aa 
Translation table11 
GC content62% 
IMG OID637965374 
Productcholine transport protein BetT 
Protein accessionYP_572286 
Protein GI92112358 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID[TIGR00842] choline/carnitine/betaine transport 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACAC AATCCAATAA CGAGGCACCG ACACGCGATC GCCTCAATGG GCCGGTGTTC 
TATAGCTCGG TGATCGGCAT CGTCGCCTTC TCGCTGTGGA CCATGGTCGC TACCGACCAG
GCCAATACCA TCATCAATGC CATACTCGGC TGGATCTCCA ACACCTTCGG CTGGTACTAC
TTCCTGACCG TCGTGATCTA CCTGGTCTTC GTGATCTATC TGGGGGTCTC GCGCTACGGC
AGCATCCGAC TGGGGCCAGA GCATTCGCGC CCCGACTTCA ATATCTTCTC GTGGTCGGCG
ATGCTGTTCT CCGCGGGGAT CGGCATCGGC GTGATCTTCT TCGCGCTGGC CGAGCCGTTG
ACGCAGTTCT ACAACGCGCC CGACGCGCCC GAGGATCAGG TCGCCGCGGC ACGCCACGCC
ATGGAACTGA CCTTTCTGCA CTGGGGGCTT TCCGGCTGGG GCATCTACAC CCTGGTGGGC
ATGGCGCTGG CCTTCTTCAG CTATCGGCAC AACCTGCCGC TGACCATCCG CAGCGCGCTC
TACCCGATCT TCGGCGAGCG GATCTACGGC ATGATCGGCC ACGTGGTCGA CACCGCTGCC
GTGCTGGGCA CGGTATTCGG CATCGCGGCC AGCCTGGGGA TCGGGGTCAT CCAGCTCAAC
TTCGGCCTGG ACTACATGTT CGGCATTCCC AAGGGTACCT GGACCCAGGT GGTGCTGGTG
CTGGGCATCG TGCTGTTCGC GACCATTTCC GCCGTGACCG GGGTGGAACG CGGCATCCGC
CGTCTTTCCG AGTTCAATAT CCTGCTCGCG GTCCTGCTGC TGCTGTTCGT GCTGTTTGCC
GGCAAGACGA TCTTTTTGCT CAACGCGCTG GTGATGAACA TCGGCGACTA CCTGACCAAC
TTCGTCAGCC TGTCGTTCAA CACCTATGCC TTCGACCGGC CGACCGGCTG GTTGAACGGC
TGGACGCTGT TCTTCTGGGC CTGGTGGATC GCGTGGGGGC CGTTCGTCGG GCTGTTCCTG
GCGCGCATCT CGCGGGGCCG GACGATCCGC ACCTTCGTGC TCGGCACCAT GACGCTGCCG
ATCATCTTCA TGTTCCTGTG GATGTCGCTG CTGGGTAACA GTGCCATCGA CATGGCGATG
AACGGCGCCA GCGAGTTCGG CGAGCAGGTG ATGAACAACC CGCCGGCGGG GATCTATCTG
TTCCTCGAAT CGTATCCGAT GCCGCTTTTG ACCACCGCGG CGGTGAGCAT CCTGGCGATC
GTGTTCTTCA TCACCTCGGG GGATTCCGGG GCGCTGGTTC TCTCGAACTT CACCTCGAAG
CTCAAGAACG TCAACAGCGA TGCGCCGGTC TGGATGCGTA TTCTGTGGTC GGCGGTGATC
GGCATCCTGA CCCTGTCGCT GCTGCTCGCC GGGGGGCTGA CCACCTTGCA GAGCGCGGTG
GTGATCACCG GACTGCCATT CTCGATCGTG CTGTTCTTCA TGATGGCGGG GCTGCTCAAG
GCACTGAAGC TCGAGGCCTT CAAGGAAGAC AGCCGGCGCC TGAGCCTGGC CGGCCAGCTT
TCCGGTCGTA CCGGTGGCGG CGAGCGCGAC TCCCGCAATT GGCAGCAGCG CCTTCGTCGT
GCCATGAGCT TCCCCGGCAA GAAGCAGGCA CGGCGCTTCA TGGAGGAAAC CTGCAAGCCC
GCCATGGAAG CCGTGCGCGA CTCGCTGCAG GAGCAAGGCG TGTCCGTCGA GATCAATCAG
GGCGTGCAGA ACGGCGACGA CTACCTGTCG CTCAACGTCG ATTTCGAGGA CGAGCAGAAC
TTCACCTACC AGGTCTGGAG TCAGGGCTTC TCGACGCCCG GATTCGCGAT GCATGCCCCG
CATGCCGACT CGCGCTACTA CCGGCTGGAG GTCTACCTGC TCGAGGGTAG CCAGGGTTAC
GACCTGATGG GCTACACCCG CGATCAGGTG ATCGGCGACA TCCTCGACCA GTACGAACTG
CACATGCAGT TCCTGCACCT CAACCGGATA GAGCCGGGCA ACATCAACAT GCCCGACAGC
CCGGAACAGC CGCCTTCATA A
 
Protein sequence
MTTQSNNEAP TRDRLNGPVF YSSVIGIVAF SLWTMVATDQ ANTIINAILG WISNTFGWYY 
FLTVVIYLVF VIYLGVSRYG SIRLGPEHSR PDFNIFSWSA MLFSAGIGIG VIFFALAEPL
TQFYNAPDAP EDQVAAARHA MELTFLHWGL SGWGIYTLVG MALAFFSYRH NLPLTIRSAL
YPIFGERIYG MIGHVVDTAA VLGTVFGIAA SLGIGVIQLN FGLDYMFGIP KGTWTQVVLV
LGIVLFATIS AVTGVERGIR RLSEFNILLA VLLLLFVLFA GKTIFLLNAL VMNIGDYLTN
FVSLSFNTYA FDRPTGWLNG WTLFFWAWWI AWGPFVGLFL ARISRGRTIR TFVLGTMTLP
IIFMFLWMSL LGNSAIDMAM NGASEFGEQV MNNPPAGIYL FLESYPMPLL TTAAVSILAI
VFFITSGDSG ALVLSNFTSK LKNVNSDAPV WMRILWSAVI GILTLSLLLA GGLTTLQSAV
VITGLPFSIV LFFMMAGLLK ALKLEAFKED SRRLSLAGQL SGRTGGGERD SRNWQQRLRR
AMSFPGKKQA RRFMEETCKP AMEAVRDSLQ EQGVSVEINQ GVQNGDDYLS LNVDFEDEQN
FTYQVWSQGF STPGFAMHAP HADSRYYRLE VYLLEGSQGY DLMGYTRDQV IGDILDQYEL
HMQFLHLNRI EPGNINMPDS PEQPPS