Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plav_2149 |
Symbol | |
ID | 5454905 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Parvibaculum lavamentivorans DS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009719 |
Strand | + |
Start bp | 2329216 |
End bp | 2331087 |
Gene Length | 1872 bp |
Protein Length | 623 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640877726 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_001413420 |
Protein GI | 154252596 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.276438 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 68 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGTCC ACACCCCCCG CAAGGATGAA GCCCTCACCG TCACCACGGG CCCGCTTCCC GCCAGCACGA AAATCTTCAC CGCGCCGGAA GGCTTTCCCG GTCTGAAAGT CCCCTTCCGC GAAATCGCGC TGCATCCTTC CGCCAAGGAG CCGCCGGTGC GCGTCTATGA TACGTCGGGG CCCTACACGG ACCCGACAGC GCAAATCGAT CTCGAACGGG GCCTGCCGCG CACCCGCGAG GCCTGGCTCG AAGCGCGCGG CGGCACGGAG CTATACGAAG GCCGCGATGT GAAGCCGGAA GACAATGGCA ATGTCGGCGA AAAACATCTC GCCCGCGCCT TCCCCGTCCG GAACCTTCCG CGCCGCGGTC TCCCCGGCCA TCCGGTCACG CAATATGAAT TCGCGAAGGC CGGCATCGTC ACCGCCGAAA TGGCCTATAT CGCCGAGCGC GAGAATATGG GCCGCAAGCA GGCCGCCGCC AATGCCGCAC ACAGGATCGC GGAAGGCGAG AGCTTCGGCG CCGATATTCC GGAATTCATC ACGCCGGAAT TCGTGCGCGA TGAAGTCGCC GCGGGCCGCG CCATCATTCC GTCCAATATC AATCACCCGG AACTCGAGCC GATGATCATC GGCCGCAATT TCCTCGTGAA GATCAATGCG AATATCGGCA ACTCCGCCGT CGCCTCGTCG GTCGCGGAAG AAGTCGACAA GATGGTCTGG GCGATCCGCT GGGGCGCCGA CAATGTCATG GACCTCTCGA CCGGCCGCAA CATCCACAAC ACGCGCGAAT GGATCATCCG CAATTCGCCG GTGCCCATCG GCACAGTGCC GATCTATCAG GCGCTGGAAA AGGTCGACGG CATCGCCGAG AACCTCACCT GGGAGGTCTA CCGCGACACG CTGATCGAGC AGGCGGAGCA GGGCGTCGAT TATTTCACCA TCCATGCGGG TGTCCGCCTC GCCTATGTGC CGCTCACGGC GAAGCGCGTG ACGGGCATTG TCTCGCGCGG CGGCTCCATC ATGGCGAAGT GGTGCCTCGC GCATCACAAG GAGAGTTTCC TCTACACCCA CTTCGAGGAA ATCTGCGACA TCATGCGCCA ATACGATGTG TCGTTCTCGC TGGGCGACGG TTTGCGTCCC GGCTCCATCG CGGACGCGAA TGACGAGGCG CAATTTGCCG AACTCGAAAC GCTGGGCGAG CTCACGCAGA TCGCGTGGGC CAAGGGCTGC CAGGTGATGA TCGAAGGCCC CGGTCATGTG CCGATGCACA AGATCAAGGT CAACATGGAC AAGCAGCTGA AGCATTGCGG CGGCGCGCCC TTCTATACGC TCGGGCCGCT CACCACCGAC ATCGCGCCGG GCTACGACCA CATCACGTCC GGCATCGGCG CGGCCATGAT CGGCTGGTTC GGCTGCGCCA TGCTCTGCTA CGTCACGCCG AAGGAACATC TCGGCCTGCC GGACAGGGCG GACGTGAAGG AAGGCGTCAT CACCTACAAG ATCGCGGCGC ATGCGGCGGA CCTCGCCAAG GGCCACCCGG CCGCGCAGCT TCGCGACGAC GCGCTTTCGC GCGCGCGGTT CGAGTTCCGC TGGGAGGACC AGTTCAACCT CGCGCTCGAC CCCGAACGCG CGAAAGAGTT CCACGACCGC ACGCTGCCGA AGGAAGCGCA CAAGGTCGCG CATTTCTGCT CCATGTGCGG CCCGAAATTC TGCTCGATGA AAATCACGCA GGAAGTCCGT GACTATGCGG AAAGCGGCAT GGCCGACATG GCGTCCGAAT TCCGCAATTC CGGCGGCGAG ATTTATCTCG AAGAAGCGGA CGCGGCGGTG AAGGCATCGA ACAGGGCGCT GGGCGGCAAG GCGGCGGAGT AG
|
Protein sequence | MNVHTPRKDE ALTVTTGPLP ASTKIFTAPE GFPGLKVPFR EIALHPSAKE PPVRVYDTSG PYTDPTAQID LERGLPRTRE AWLEARGGTE LYEGRDVKPE DNGNVGEKHL ARAFPVRNLP RRGLPGHPVT QYEFAKAGIV TAEMAYIAER ENMGRKQAAA NAAHRIAEGE SFGADIPEFI TPEFVRDEVA AGRAIIPSNI NHPELEPMII GRNFLVKINA NIGNSAVASS VAEEVDKMVW AIRWGADNVM DLSTGRNIHN TREWIIRNSP VPIGTVPIYQ ALEKVDGIAE NLTWEVYRDT LIEQAEQGVD YFTIHAGVRL AYVPLTAKRV TGIVSRGGSI MAKWCLAHHK ESFLYTHFEE ICDIMRQYDV SFSLGDGLRP GSIADANDEA QFAELETLGE LTQIAWAKGC QVMIEGPGHV PMHKIKVNMD KQLKHCGGAP FYTLGPLTTD IAPGYDHITS GIGAAMIGWF GCAMLCYVTP KEHLGLPDRA DVKEGVITYK IAAHAADLAK GHPAAQLRDD ALSRARFEFR WEDQFNLALD PERAKEFHDR TLPKEAHKVA HFCSMCGPKF CSMKITQEVR DYAESGMADM ASEFRNSGGE IYLEEADAAV KASNRALGGK AAE
|
| |