Gene Plav_2644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_2644 
Symbol 
ID5456666 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp2847636 
End bp2849360 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content59% 
IMG OID640878221 
Productcholine dehydrogenase 
Protein accessionYP_001413909 
Protein GI154253085 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID[TIGR01810] choline dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.3131 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value0.995844 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGCCGC ACAGGCATCA GACGTCGCGG CGGGCGGTAC CGCGAACGGC AAAACAACAA 
ATAACTGGAA ACAAGCGTAT GAGCGATTTC GACTACATCA TCATCGGTGC CGGCAGCGCG
GGATGCGTGC TGGCGAACCG CCTGTCGGAG AACCCGGCGA ACAAGGTGCT GCTGCTCGAA
GCAGGCTCGA AAGATTCCAA TTTCATGATT CACATGCCGG CAGGCGTCGG CAAGCTGATC
GGCACGGATC TCGCCAACTG GTGCTACGAC ACGGAAGGCC AGCCCCACCT GAACAACCGC
AAGCTCTATT GGCCGCGCGG CAAGGTTCTC GGCGGGTCGT CCTCTATCAA CGGCATGATC
TATATTCGCG GTCATGCGCG CGATTACGAC ATGTGGCGTC AGCTTGGTCT GGAAGGGTGG
GGCTTCTCCG ATGTTCTGCC CTATTTCCGC CGGTCGGAGG GCAATGAGAA CGGCAACAGC
GCCTTTCATG GCGGCGAAGG CCCGCTCGGC GTCAGCAATC CGCGCAAGAC CAATGTGCTC
TTCGAGTCCT TTGTCGAAGC GGGCAAGCAG GCGGGGCATC CCTATACGGA AGATTTCAAC
GGGCCGCAGC AGGAAGGCGT CGGTCCTTAC CAGCTCACGA TCAAGAACGG TCAGCGCTGC
AGCGCCGCCA AGGGTTATCT CGTGCCGGCC CTCAACCGTC CGAACCTCAA GATCGAGGTT
GAAGCGCTTA CTTCACGCGT GATCTTCGAA GGCAAGAAGG CAGTCGGCGT CGAATATACG
CAGAAGGGCG AAACGAAAGT CGCACGTGCG GCGAAGGAAA TCGTCGTCTC CGGCGGTGCG
GTCAACACGC CGCAAATCCT CATGCTTTCG GGCATCGGCA AGGGCGAGTA TCTGCGCAAG
TTCGGCCTCG ACGTGGTCGC GGACCTACCG GGCGTCGGCC AGAACCTGCA GGACCATCTT
GATTGCGTCG TCATCAACGA ATGCACGCAG CCGATCACAC TGCACAGCAC GGTCAGCAAT
CCGCTGAAGC AGCTGATGAG CGGCATGCAG TACACCTTCT TCAAAACCGG CCTTGCGACG
TCGAACGGTC TTGAATCCGG CGCTTTCCTG AAGACGCGGC CGGAGCTCGA AATTCCCGAT
ATCCAGCTTC ACTTCGTGGC CGCAATGATG CGCGATCATG CGCGGATAAA ATCTGATCGT
CACGGGTTCA CGGTGCACAT CTGTCAACTT CGACCGGAAA GCCGTGGCTA TATCGGCCTC
AAATCGACCA ACCCGTCCGA TTATGCGCTG ATCCAGCCGA ATTATCTGGC GGCCGAATAC
GACCGCAAGG TGATGCGCGA CGGTGTGAAA ATGGTGCGCA ATATTATTTC GCAGCGCGCG
ATGGACCCCT ATCGCGGGCC GGAGTTCTGG CCGGGTGCGG GCAAGCAGTC GGACGCGGAA
ATCGATGCGT GGATCCGCGA AACCGCGGAG ACAATCTATC ATCCGGTCGG CACCGCCAAG
ATGGGCACGG ACCCGATGGC TGTGGTCGAC GCGAAATGCC GCGTTCATGG GCTCCAAGGG
CTCCGTGTCG TCGATGCCTC CGTGATGCCG ACACTGGTTG GGGGCAACAC CAATGCTCCG
ACGATCATGA TCGCGGAAAA AATTTCCGAT GACATGCTCG GCAAGGCGCC ACTGCCGGCC
GAAAATGTGA CGATTGCGGA AGACCGTATC GGCAACGCAG CCTGA
 
Protein sequence
MLPHRHQTSR RAVPRTAKQQ ITGNKRMSDF DYIIIGAGSA GCVLANRLSE NPANKVLLLE 
AGSKDSNFMI HMPAGVGKLI GTDLANWCYD TEGQPHLNNR KLYWPRGKVL GGSSSINGMI
YIRGHARDYD MWRQLGLEGW GFSDVLPYFR RSEGNENGNS AFHGGEGPLG VSNPRKTNVL
FESFVEAGKQ AGHPYTEDFN GPQQEGVGPY QLTIKNGQRC SAAKGYLVPA LNRPNLKIEV
EALTSRVIFE GKKAVGVEYT QKGETKVARA AKEIVVSGGA VNTPQILMLS GIGKGEYLRK
FGLDVVADLP GVGQNLQDHL DCVVINECTQ PITLHSTVSN PLKQLMSGMQ YTFFKTGLAT
SNGLESGAFL KTRPELEIPD IQLHFVAAMM RDHARIKSDR HGFTVHICQL RPESRGYIGL
KSTNPSDYAL IQPNYLAAEY DRKVMRDGVK MVRNIISQRA MDPYRGPEFW PGAGKQSDAE
IDAWIRETAE TIYHPVGTAK MGTDPMAVVD AKCRVHGLQG LRVVDASVMP TLVGGNTNAP
TIMIAEKISD DMLGKAPLPA ENVTIAEDRI GNAA