Gene Plav_2149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_2149 
Symbol 
ID5454905 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp2329216 
End bp2331087 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content64% 
IMG OID640877726 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_001413420 
Protein GI154252596 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.276438 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGTCC ACACCCCCCG CAAGGATGAA GCCCTCACCG TCACCACGGG CCCGCTTCCC 
GCCAGCACGA AAATCTTCAC CGCGCCGGAA GGCTTTCCCG GTCTGAAAGT CCCCTTCCGC
GAAATCGCGC TGCATCCTTC CGCCAAGGAG CCGCCGGTGC GCGTCTATGA TACGTCGGGG
CCCTACACGG ACCCGACAGC GCAAATCGAT CTCGAACGGG GCCTGCCGCG CACCCGCGAG
GCCTGGCTCG AAGCGCGCGG CGGCACGGAG CTATACGAAG GCCGCGATGT GAAGCCGGAA
GACAATGGCA ATGTCGGCGA AAAACATCTC GCCCGCGCCT TCCCCGTCCG GAACCTTCCG
CGCCGCGGTC TCCCCGGCCA TCCGGTCACG CAATATGAAT TCGCGAAGGC CGGCATCGTC
ACCGCCGAAA TGGCCTATAT CGCCGAGCGC GAGAATATGG GCCGCAAGCA GGCCGCCGCC
AATGCCGCAC ACAGGATCGC GGAAGGCGAG AGCTTCGGCG CCGATATTCC GGAATTCATC
ACGCCGGAAT TCGTGCGCGA TGAAGTCGCC GCGGGCCGCG CCATCATTCC GTCCAATATC
AATCACCCGG AACTCGAGCC GATGATCATC GGCCGCAATT TCCTCGTGAA GATCAATGCG
AATATCGGCA ACTCCGCCGT CGCCTCGTCG GTCGCGGAAG AAGTCGACAA GATGGTCTGG
GCGATCCGCT GGGGCGCCGA CAATGTCATG GACCTCTCGA CCGGCCGCAA CATCCACAAC
ACGCGCGAAT GGATCATCCG CAATTCGCCG GTGCCCATCG GCACAGTGCC GATCTATCAG
GCGCTGGAAA AGGTCGACGG CATCGCCGAG AACCTCACCT GGGAGGTCTA CCGCGACACG
CTGATCGAGC AGGCGGAGCA GGGCGTCGAT TATTTCACCA TCCATGCGGG TGTCCGCCTC
GCCTATGTGC CGCTCACGGC GAAGCGCGTG ACGGGCATTG TCTCGCGCGG CGGCTCCATC
ATGGCGAAGT GGTGCCTCGC GCATCACAAG GAGAGTTTCC TCTACACCCA CTTCGAGGAA
ATCTGCGACA TCATGCGCCA ATACGATGTG TCGTTCTCGC TGGGCGACGG TTTGCGTCCC
GGCTCCATCG CGGACGCGAA TGACGAGGCG CAATTTGCCG AACTCGAAAC GCTGGGCGAG
CTCACGCAGA TCGCGTGGGC CAAGGGCTGC CAGGTGATGA TCGAAGGCCC CGGTCATGTG
CCGATGCACA AGATCAAGGT CAACATGGAC AAGCAGCTGA AGCATTGCGG CGGCGCGCCC
TTCTATACGC TCGGGCCGCT CACCACCGAC ATCGCGCCGG GCTACGACCA CATCACGTCC
GGCATCGGCG CGGCCATGAT CGGCTGGTTC GGCTGCGCCA TGCTCTGCTA CGTCACGCCG
AAGGAACATC TCGGCCTGCC GGACAGGGCG GACGTGAAGG AAGGCGTCAT CACCTACAAG
ATCGCGGCGC ATGCGGCGGA CCTCGCCAAG GGCCACCCGG CCGCGCAGCT TCGCGACGAC
GCGCTTTCGC GCGCGCGGTT CGAGTTCCGC TGGGAGGACC AGTTCAACCT CGCGCTCGAC
CCCGAACGCG CGAAAGAGTT CCACGACCGC ACGCTGCCGA AGGAAGCGCA CAAGGTCGCG
CATTTCTGCT CCATGTGCGG CCCGAAATTC TGCTCGATGA AAATCACGCA GGAAGTCCGT
GACTATGCGG AAAGCGGCAT GGCCGACATG GCGTCCGAAT TCCGCAATTC CGGCGGCGAG
ATTTATCTCG AAGAAGCGGA CGCGGCGGTG AAGGCATCGA ACAGGGCGCT GGGCGGCAAG
GCGGCGGAGT AG
 
Protein sequence
MNVHTPRKDE ALTVTTGPLP ASTKIFTAPE GFPGLKVPFR EIALHPSAKE PPVRVYDTSG 
PYTDPTAQID LERGLPRTRE AWLEARGGTE LYEGRDVKPE DNGNVGEKHL ARAFPVRNLP
RRGLPGHPVT QYEFAKAGIV TAEMAYIAER ENMGRKQAAA NAAHRIAEGE SFGADIPEFI
TPEFVRDEVA AGRAIIPSNI NHPELEPMII GRNFLVKINA NIGNSAVASS VAEEVDKMVW
AIRWGADNVM DLSTGRNIHN TREWIIRNSP VPIGTVPIYQ ALEKVDGIAE NLTWEVYRDT
LIEQAEQGVD YFTIHAGVRL AYVPLTAKRV TGIVSRGGSI MAKWCLAHHK ESFLYTHFEE
ICDIMRQYDV SFSLGDGLRP GSIADANDEA QFAELETLGE LTQIAWAKGC QVMIEGPGHV
PMHKIKVNMD KQLKHCGGAP FYTLGPLTTD IAPGYDHITS GIGAAMIGWF GCAMLCYVTP
KEHLGLPDRA DVKEGVITYK IAAHAADLAK GHPAAQLRDD ALSRARFEFR WEDQFNLALD
PERAKEFHDR TLPKEAHKVA HFCSMCGPKF CSMKITQEVR DYAESGMADM ASEFRNSGGE
IYLEEADAAV KASNRALGGK AAE