Gene Plav_2603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_2603 
Symbol 
ID5455797 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp2806469 
End bp2808526 
Gene Length2058 bp 
Protein Length685 aa 
Translation table11 
GC content60% 
IMG OID640878180 
Productcholine transport protein BetT 
Protein accessionYP_001413868 
Protein GI154253044 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID[TIGR00842] choline/carnitine/betaine transport 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.158704 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGATG TTACGGACGG CGCCGGAACG AAGGACAGAG AAGACCGGAT CAATCCGGCC 
GTCTTCTACA CATCCGCCGT CGGCATCGCG CTTTTTGCCA TCTGGACGAT GTTTTTCATC
GATTCCGCGA ATTTCGTCAT CAATTCAGTG CTGGCGTGGA TTTCCGATGT GTTCGGCTGG
TTCTACTTCG TTGCCGTGGT TCTCTATCTG GTTTTTGTGA TCGGCATCGG ACTATCCCGC
TTCGGCAAGG TACGTCTCGG TCCCGACCAT GCCAGGCCGG AATTCAACGC GATCACCTGG
GCGGCAATGC TTTTCGCGGC GGGTATCGGT ATTGATCTCC TCTTCTTCTG CGTCTCCGAA
CCTGTCACGC AGTTCCTTGC GCCTCCCGAG GGCGAGGGCA GCACGGTCGA GGCGGCGCGG
CACGCGATGC AGCTCACCTT TCTCCATTGG GGCTTGTCCG GGTGGGGCGT TTACACGCTG
GTCGGGATGT CGCTTGCCTA TTTCAGCTAT CGGCACGGCC TGCCGCTGAC CATTCGTTCA
GCGCTCTTTC CCATTTTCGG CCAGCGCATC TATGGCGTGA TCGGCCATAC GGTCGATATC
GCCGCCGTTA TCGGGACCGT GTTCGGCATC GCGACGAGCC TCGGCATCGG CATCATCCAG
CTCAACTACG GGCTGGAGCA TATGTTCGGT ATTCCGCAGA GCACCCTTAC CCAAGGCGCC
CTGGTGATCC TGATCATCGT TTTCGCCGCG CTTTCGGCCG CGACGGGTGT GGAGCGGGGT
ATTCGCCGCC TTTCGGAATT CAACATGGCG CTGGCGCTGC TCCTGTTGCT TTTCGTGCTG
TTCGTGGGCG AGACGACCTT CCTGCTCAAT GCGCTGGTGA TGAATATCGG GGATTATCTT
TCCGACTTCG TCAGCCTTTC CTTCAACACC TATGCGTTTG ACCCGCCTGT CGACTGGCTG
AATGCGTGGA CGGTCTTTTT CTGGGCGTGG TGGATTGCAT GGGGACCCTT TGTCGGGCTC
TTTCTTGCCC GCATTTCGCG CGGCCGCACC ATTCGCCAAT TTGTGGCCGG CACGCTGATC
CTGCCGCTCG TCTTCATGAT GGGGTGGATG TCGATCATGG GGAACAGCGC CATCGAACTT
GTGATGTCCG GTGCGACAGA GTTCGGTGAT GAGGCGATGG CCAATCCAGG TTCCGCGATC
TACCTGTTCA TGCAGAGCCT GCCATTAGCC ACGGTTACGA CCATCGTCGT TACGCTTCTC
GGCATCGTCT TTTTCATCAC TTCGGGTGAC TCAGGCTCGC TGGTGCTTTC GAACTTCACG
TCGACCCTCA AGGACGTCAA TTCCGATGCG CCGGTCTGGA TGCGGGTTCT GTGGGCGACC
ATTATCGGCG TGCTGACGCT GGCGCTGCTC CTTGCAGGGG GGCTTGAGGC GCTGCAGAGC
ACGGTCGTCA TCATGGGGCT GCCATTCTCC ATCGTGCTGT TCTTGATGAT GCTGGGCCTC
TTCAGGGCGC TCAGGGTCGA GGGCATGAAG GAAGACAGCC ATCGCGCCAG TCTTTCCGGC
TACCTCTCCG GCAGGATCGG CGCGCCTGCG TCCAACTGGC GGCAGCGGAT TGCGCGGGCG
ACGAGCTTCC CGACGTTGGC GCAGGTACGG CGCTTCATGC GGGACGCCGT GAGGCCGGCG
ATGGAAGAGA TCCGGGAAAC GCTTGAAAAG CGGGGATTTC CGGCCCGGAT CGTCGAGGGC
GAGGGGGACG ATGGCTGCCT GTCGCTCAAT GTCGCAATGG CCGACGACCA GGACTTCACC
TACGAGGTCT GGCCGGTATG CGGCACGATG CCTGCCTTCG CGGTACGGCC GCAACAGACG
GCTTCGGAGT ATTACCGCGC CGAGGTGCAT CTCTTCGAGG GCAGCCAGGG CTACGACCTG
ATGGGCTACA CGAAGGAGCA AGTGATCGAG GACATCCTCG ATCAGTTCGA GCGGCACATG
CATTTCCTTC ATACGCAGCG CGAGGCGCCG GGCGGGGCAA CCCGGATGCC GGACGACAGC
ACCAAGGGGC CGGACTGA
 
Protein sequence
MSDVTDGAGT KDREDRINPA VFYTSAVGIA LFAIWTMFFI DSANFVINSV LAWISDVFGW 
FYFVAVVLYL VFVIGIGLSR FGKVRLGPDH ARPEFNAITW AAMLFAAGIG IDLLFFCVSE
PVTQFLAPPE GEGSTVEAAR HAMQLTFLHW GLSGWGVYTL VGMSLAYFSY RHGLPLTIRS
ALFPIFGQRI YGVIGHTVDI AAVIGTVFGI ATSLGIGIIQ LNYGLEHMFG IPQSTLTQGA
LVILIIVFAA LSAATGVERG IRRLSEFNMA LALLLLLFVL FVGETTFLLN ALVMNIGDYL
SDFVSLSFNT YAFDPPVDWL NAWTVFFWAW WIAWGPFVGL FLARISRGRT IRQFVAGTLI
LPLVFMMGWM SIMGNSAIEL VMSGATEFGD EAMANPGSAI YLFMQSLPLA TVTTIVVTLL
GIVFFITSGD SGSLVLSNFT STLKDVNSDA PVWMRVLWAT IIGVLTLALL LAGGLEALQS
TVVIMGLPFS IVLFLMMLGL FRALRVEGMK EDSHRASLSG YLSGRIGAPA SNWRQRIARA
TSFPTLAQVR RFMRDAVRPA MEEIRETLEK RGFPARIVEG EGDDGCLSLN VAMADDQDFT
YEVWPVCGTM PAFAVRPQQT ASEYYRAEVH LFEGSQGYDL MGYTKEQVIE DILDQFERHM
HFLHTQREAP GGATRMPDDS TKGPD