Gene Plav_2604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_2604 
Symbol 
ID5455046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp2808569 
End bp2810257 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content63% 
IMG OID640878181 
Productcholine dehydrogenase 
Protein accessionYP_001413869 
Protein GI154253045 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID[TIGR01810] choline dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.142768 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGGACC AATCCTTCGA ACGCGAGGCC GATTATGTGA TTGTCGGCGC CGGATCAGCG 
GGTTGCGTCC TGGCGGACCG TCTCACCGCG GAGGGCAGGC ACAAGGTGCT GGTGCTTGAG
ACGGGCGGAA GGGACAACTC CGTCTATATC AAGATGCCGA CTGCATTTTC CATCCCGCTT
GGCATGAAGA AATACGACTG GGGCATGCAT GCCGAACCGG AGCCGGGGCT CAATGGGCGG
CGGCTGCACC AGGCGCGGGG CAAGGTGATC GGCGGATCGT CGTCGATCAA CGGGCTTGCC
TATGTGCGCG GCTGCGCGGG CGACTTCGAG GAATGGGCGG AGCTGGGCGC GGCGGGATGG
GACTATGCGA GCGTGCTGCC CTATTTCCGG CGTTCCGAGG ACTGCCTCTA TGGCGAGGAT
GCCTATCGCG GCACAGGCGG GCCGGTCGGC ATCACCAATG GCAACAACAT GAAAAACCCC
CTTTACCGCG CCTTTATCGA GGCGGGGCGG CAGGCCGGCT ATGGCATGAC GGAGGATTAT
AACGGCTACC GGCAGGAGGG CTTCGGCCGC ATGGACATGA CCGTGCGGGA CGGGATTCGT
TGTTCTACGG CAGTGGCTTA CCTGAAACCG GCGATGAAGC GCGACAACCT CGAAGTGGAG
ATGCACGCGC TGGCGACGCG CATTCTGATG GAGGGCAAGC GCGCGGTCGG CGTCGAATAC
AGGCGGCGCG GCAAGCTCCA TCGCGTAAAG GCGCGGCGCG AGGTCATCGT CTCGGCGAGC
TCCTTCAACT CGCCGAAACT GCTGATGCTG TCCGGCATCG GCCCGGCGGC GCATCTGAAG
GAACACGGAA TTCCGGTCAT CCACGATCTT CCGGGGGTTG GGGATAACTT GCAGGACCAT
CTGGAGGTCT GGGTGCAGCA AACCTGCACG CAGCCGATCA CGCTGAACGG GACGCTGGGG
CCCATTTCGA AGCTGCTGAT CGGCATGGAA TGGTTCTTCC TCAAGCGCGG CCTCGGCATT
TCCAACCAGT TCGAGTCGAA CGGCTATATC AGGAGCCGCG CGGGCCTCAA GTATCCGGAT
TTGCAGTATC ATTTCCTTGC CGGCGCCATC GCTTATGACG GCTCCAGCGC GGCGGAGGGA
CATGGATTCC AGGTCCATCT GGGCGCCAAC AAGCCGAAAA GCCGCGGCCG GGTGAGCCTC
AACTCGGCCG ACCCCGAAGC ACCGCCAAAG CTCGTTTTCA ACTACCTGAC GGAAGAGGCG
GACAAGCAGG CCTATCGCGA CGGATTGCGG CTGACGCGCG AGATTTTCGC GCAAAAGGCC
TTCGATCCCT ATCGGGGGGA CGAGATATCC CCGGGGCCGA AAGTGCGGAC CGATGCGGAA
ATCGACCAGT GGGTGGCGGA AACGGCGGAG ACCGCCTATC ACCCCGCGGG CACCTGCCGG
ATGGGCGCGG ACGGCATGGC GGTGGTGGAC AGCGAGTGCC GGGTGCATGG CATCGAGGCG
CTGAGGGTGG TCGACTCATC CATCATGCCG ACGCTGCCGA ACGGCAACAT CAATGCCCCG
ACGATCATGA TCGGCGAGAA GGCGGCGGAC CATATTCTCG GAAAGCCGCT CCTGCCGGCT
TCGACGGCGG ACGCTTATCA CGCGCCGAAT TGGCAGGAAA ATCAAAGGGT TGGCAAGCCG
GAGCGATAG
 
Protein sequence
MRDQSFEREA DYVIVGAGSA GCVLADRLTA EGRHKVLVLE TGGRDNSVYI KMPTAFSIPL 
GMKKYDWGMH AEPEPGLNGR RLHQARGKVI GGSSSINGLA YVRGCAGDFE EWAELGAAGW
DYASVLPYFR RSEDCLYGED AYRGTGGPVG ITNGNNMKNP LYRAFIEAGR QAGYGMTEDY
NGYRQEGFGR MDMTVRDGIR CSTAVAYLKP AMKRDNLEVE MHALATRILM EGKRAVGVEY
RRRGKLHRVK ARREVIVSAS SFNSPKLLML SGIGPAAHLK EHGIPVIHDL PGVGDNLQDH
LEVWVQQTCT QPITLNGTLG PISKLLIGME WFFLKRGLGI SNQFESNGYI RSRAGLKYPD
LQYHFLAGAI AYDGSSAAEG HGFQVHLGAN KPKSRGRVSL NSADPEAPPK LVFNYLTEEA
DKQAYRDGLR LTREIFAQKA FDPYRGDEIS PGPKVRTDAE IDQWVAETAE TAYHPAGTCR
MGADGMAVVD SECRVHGIEA LRVVDSSIMP TLPNGNINAP TIMIGEKAAD HILGKPLLPA
STADAYHAPN WQENQRVGKP ER