Gene Sare_3803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3803 
Symbol 
ID5704554 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4332499 
End bp4334094 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content71% 
IMG OID641273225 
Productcholine/carnitine/betaine transporter 
Protein accessionYP_001538587 
Protein GI159039334 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID[TIGR00842] choline/carnitine/betaine transport 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.19587 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACATGG GCGGATCGGC TGGGCGGCCC GGCAGCGGGA GCGTCGACCG GTTGCTGCTG 
TGCCTCGCCG TTGGCGGCGT GCTGGCCGTC GTGGCCTGGG GCGTCCTGGA CCGGGAGTCG
GTGTCGGCTG TCGGTCAGAC CGGGCTGAAC TGGGTCATCA CCACGTTCGG CTGGTTGTTC
GTCGTCGCGG CGAACGCCTT TCTGGTGTTG GCCGTGGTGC TGGCACTCTC CCGCTTCGGC
ACCATTCGGC TGGGGCCCGA CGCCGAGCGG CCGGAGTTCA GCACGTTGGC CTGGGTCGCC
ATGATGTTCA GCGCAGGCAT GGGGATCGGC CTGGTCTTCT TCGCCGTGGC CGAGCCGATC
CAGCACTACG CGTCGCCGCC GCCGGCGACC GGGGTCGAGC CGGAGACCGG TGCCGCCGCC
TCGGCTGCCA TGCAGTTCAC CCTGTTCCAC TGGACGCTGC ACCCGTGGGC GATCTACGCG
GTGGTGGCGC TCGCCCTGGC GTACTCGACC TTCCGCAAGG GGCGGGAGAA CCGGATCTCG
GCCGTGTTCC GTCCGGTGCT CGGCGACCGG GCGGACGGCG CGGCCGGACG GGTGATCGAC
CTGCTGGCGG TCTTCGCCAC GGTCTTCGGC ACAGCGACCA GCCTCGGGCT CGGCGCGCTC
CAGGTCACCG CGGGCCTGGA CCGGGTCGCC GGGATTCCCG ACAGCACCAC GGCGGAGCTG
GTGGTGATCG GGGCGTTGAC CCTGGCCTTC GTCGTCTCGG CCTTCTCCGG GCTGTACCGG
GGCATCAAGT GGCTGTCCAC CACCAACGTG GTGCTGGCGG TGCTGCTGAT GCTGTTCGTC
TTCGTGGTCG GCCCGACGGT CTACGTCCTG GATGTGCTGC CCGCCTCGAT CGGCGACTAC
GTCAGCAACC TGGTCTTCAT GTCGACGCGG ACCGGGGCCT TCTCCGACCC GTCCTGGTTG
GGCTCCTGGA CGATCTTCTA CTGGGCGTGG TGGATCTCCT GGGCCCCGTT CGTCGGTACC
TTCATCGCCC GCATCTCCCG TGGTCGTACG GTGCGCCAGT TTCTGGTCGG CGTGCTGCTG
GTGCCCAGCG GGGCCAGTGT GGTCTGGTTC GCGGTGATGG GCGGCAGCGC GCTGCGGGTG
CAGGCCACCG GCACCCGGGA CCTGGTCGCC GAGGCCGCCG CCGGCGCCGA CCAGGCACTC
TTCGGGTTGC TCGACGCGTT GCCGCTGGGC GCGCTGACCA GTGTGCTGGC CATGGCGCTG
GTGATGCTCT ACTTCGTCAC CAGTGCCGAC TCCGCCTCCC TCGTGCTCGC GTCGCTGACC
TCCCGGGGCG CGTTGCGTCC GCGCCGGTTG CTCGTCGTCA CCTGGGGTGT GTTGATCGGT
GGGACCGCCG CGGTGCTGCT GCTGGCCGGC GGGCTGAACG CGCTCCAGCA GGCGACGATC
ATGGTCGCGT TGCCGTTCGT GGTGGTGATG CTCGGCCTGG CCGTGTCGTT GGTCAAGGAG
ATGTCCCAGG ACCCGGCGGT GCGGGTCCCC CCGCCCCAAC CGCACGGGCT GGCCGCCGCC
CTCCACCGGG CCCGCTCGAC CGAGGAGGAA CACTAG
 
Protein sequence
MDMGGSAGRP GSGSVDRLLL CLAVGGVLAV VAWGVLDRES VSAVGQTGLN WVITTFGWLF 
VVAANAFLVL AVVLALSRFG TIRLGPDAER PEFSTLAWVA MMFSAGMGIG LVFFAVAEPI
QHYASPPPAT GVEPETGAAA SAAMQFTLFH WTLHPWAIYA VVALALAYST FRKGRENRIS
AVFRPVLGDR ADGAAGRVID LLAVFATVFG TATSLGLGAL QVTAGLDRVA GIPDSTTAEL
VVIGALTLAF VVSAFSGLYR GIKWLSTTNV VLAVLLMLFV FVVGPTVYVL DVLPASIGDY
VSNLVFMSTR TGAFSDPSWL GSWTIFYWAW WISWAPFVGT FIARISRGRT VRQFLVGVLL
VPSGASVVWF AVMGGSALRV QATGTRDLVA EAAAGADQAL FGLLDALPLG ALTSVLAMAL
VMLYFVTSAD SASLVLASLT SRGALRPRRL LVVTWGVLIG GTAAVLLLAG GLNALQQATI
MVALPFVVVM LGLAVSLVKE MSQDPAVRVP PPQPHGLAAA LHRARSTEEE H