Gene Plav_0421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_0421 
Symbol 
ID5454269 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp452686 
End bp454671 
Gene Length1986 bp 
Protein Length661 aa 
Translation table11 
GC content61% 
IMG OID640875987 
Productcholine/carnitine/betaine transporter 
Protein accessionYP_001411701 
Protein GI154250877 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID[TIGR00842] choline/carnitine/betaine transport 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.45567 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAAG AAAAACCGGG CGCGGAAAAA TCCCGTCTCT TTCAGGCAAA CGCGCCGGTC 
TTTTTCGGGT CCGTCATCCT CGTTCTGGCG GCGCTTGTCT TCTCCGTCCG CGATCCGGCG
AATGCCGCCA ACATTTTCGC TTCGGTTCAG TCATGGATCA TCCATGAGAT GGGCTGGTTC
TATGTCGCCT CCGTCGCGAC ATTCCTGATT TTCTCCGTCG GCGTCGCCGC GAGTTCCATG
GGTGCGATCA AGCTCGGCCC GGATGACTCC GAACCCGATT TCTCGACGGC GTCCTGGTTC
GCCATGTTGT TCAGCGCCGG CATGGGCATC GGGATCATGT TCTACGGCGT CGCGGAGCCG
GTGCTGCATT TCGCCAGCCC GCCGGTGGGC GAGGGGGAGA CGCTGGACGC CGCGCGCAAT
GCAATGCAAC TCGCCTTCTT TCACTGGGGG CTTCACGCCT GGGCGATCTA TGCCGTGATG
GGTATGGCGC TCGCCTATTT TTCCTTCCGT CACGGACTGC CGCTCACGGT GCGCTCGGCG
CTCTATCCGC TGATCGGCGA GCGTATCCAC GGCTGGATAG GGCACCTCGT CGACATCTTC
GCCGTCTTCG GCACGATGTT CGGCGTCGCT ACCTCGCTCG GCCTTGGCGT GATGCAGGTA
AATGCCGGCC TCAACTACCT GTTCGACATC GAGATGGCGC TCGGCGTTCA GCTCGCCCTG
ATCGCAGGCA TAACGCTTCT GGCGACCGCC TCCGTCGCTT CCGGTATCAA TGCGGGCATT
CGCCGCCTCT CTGAGCTCAA TCTCGGCCTG GCGCTGCTGC TCATGCTTTT CGTGCTGTTT
GTGGGGCCGA CTGTCTTCCT GCTGCAGGCG GCCATGCAGA ATACAGGCGG CTATTTGAGC
GACATCATCG GCAAGACATT CACGCTCTAC GCCTATCAGC CGAATGAATG GATCGGAAAC
TGGACCCTCT TTTACTGGGG ATGGTGGATT TCATGGTCTC CCTTCGTCGG CATGTTCATC
GCGCGCATTT CGCGCGGCCG CACAATCCGC CAATTTGTTG TGGGCGTGCT CCTCGTGCCG
TCCGGCTTCA CCTTCCTCTG GTTCACCGTG TTCGGGAATA CGGCGCTCGC TATGCAGCTT
GATGGTTCGG CCGAGATGGT CGGCGCCGTC CAGGCCGATG TCGCGGTCGC GCTTTTTCAG
TTCCTCGAGC ACCTTCCCCT TGCCGGTATC TCGATGTCGC TCGCAACGTT GCTTGTCGTG
ACCTTCTTCG TCACATCGTC GGATTCCGGG AGCCTCGTCA TCGACATCAT CACCTCGGGC
GGTAAAGCGG AGCCTCCGGT CTGGCAGCGC GTATTCTGGG CCTTGATGGA GGGCGTGGTT
GCGGCCGTGC TGCTGCTGGC GGGCGGTCTG GCCGCGTTGC AGACAGGCGC CATCGCCAGT
GGCTTTCCGT TGGCCGCCAT TCTGCTCATC GTCTGCTACG GCCTCTTCAC CGCTCTCCGC
CGGGAGACGC AACGCCAGAA GAGCCTTCAA TTCGGCATTC CGATTGTCGC GAGCCATCCG
CCCCTCGCCT GGAAACAGCG TCTCGGGACG CTTCTTCATC AGCCGACGCG CGAGCGGGCA
TCGGCTTTCA TGCGCGATGT CGTGGGCCCC GCGCTCAATG AGGTCGCGGT GGAAATTCGT
CATCGCGGTC TTGAGGTGGA TGTCACCGCA ACCGAGAGGA AGACACAACT GTCGGTCGGG
CATGGCGCCA GCGACGATTT CCTCTATGGC GTCGCGCTTC GGAGCGTACC GGTCCCAAGC
TTTGCCATCA CTGCGCTTGA AGGTGAGCGC GAAGGGCCGG ATCACACCTG GCGCGCGGAA
GTCGCCCTGC GTGAGGGCGG TCAACGCTAC GATATTTTGG GTTACACCAA GGAACAGGTG
ATCGCGGACC TTCTGGGGCA GTATGAGCGG CACATGCATT ATCTGAATCT GCACCGGGCC
GGATAG
 
Protein sequence
MAEEKPGAEK SRLFQANAPV FFGSVILVLA ALVFSVRDPA NAANIFASVQ SWIIHEMGWF 
YVASVATFLI FSVGVAASSM GAIKLGPDDS EPDFSTASWF AMLFSAGMGI GIMFYGVAEP
VLHFASPPVG EGETLDAARN AMQLAFFHWG LHAWAIYAVM GMALAYFSFR HGLPLTVRSA
LYPLIGERIH GWIGHLVDIF AVFGTMFGVA TSLGLGVMQV NAGLNYLFDI EMALGVQLAL
IAGITLLATA SVASGINAGI RRLSELNLGL ALLLMLFVLF VGPTVFLLQA AMQNTGGYLS
DIIGKTFTLY AYQPNEWIGN WTLFYWGWWI SWSPFVGMFI ARISRGRTIR QFVVGVLLVP
SGFTFLWFTV FGNTALAMQL DGSAEMVGAV QADVAVALFQ FLEHLPLAGI SMSLATLLVV
TFFVTSSDSG SLVIDIITSG GKAEPPVWQR VFWALMEGVV AAVLLLAGGL AALQTGAIAS
GFPLAAILLI VCYGLFTALR RETQRQKSLQ FGIPIVASHP PLAWKQRLGT LLHQPTRERA
SAFMRDVVGP ALNEVAVEIR HRGLEVDVTA TERKTQLSVG HGASDDFLYG VALRSVPVPS
FAITALEGER EGPDHTWRAE VALREGGQRY DILGYTKEQV IADLLGQYER HMHYLNLHRA
G