Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4096 |
Symbol | |
ID | 6411780 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 4394348 |
End bp | 4396273 |
Gene Length | 1926 bp |
Protein Length | 641 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 642713978 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_001993067 |
Protein GI | 192292462 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000111141 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATATCC GCTCCAATCC CGAGACCACC CGCGCGGCTG TGACCACCGG TGCCCTGCCC TCCTCCAAGA AGATCTACGC CACGCCTGCC AGCGCGCCGG ATCTGCGCGT GCCGCTGCGC GAGATCATCC TGAGCGAAGG CGCAGGCGAG CCGAACCTGC CGGTGTACGA CACCTCCGGC CCCTACACCG ATCCGACCGT CGTGATCGAC GTCAACAAGG GCCTGCCGCG TCCGCGCACC GAATGGGTCA AGCAGCGCGG CGGCGTCGAG CAATATGAGG GCCGCGACAT CAAGCCGGAA GACAACGGCA ATGTCGGCGC TGCGCACGCG GCCAAAGCCT TCACCGCGCA TCACCAGCCG CTGCGCGGGA TCAGTGATGC GCCGATCACC CAATACGAAT TCGCCCGCCG CGGCATCATC ACCAAGGAGA TGATCTACGT CGCCGAGCGC GAGAATTTGG GCCGCAAGCA GCAGCTCGAG CGTGCCGAGG CGGCGCTGGC CGATGGTGAA TCGTTCGGCG CCGCGGTGCC GGCGTTCATC ACGCCGGAAT TCGTCCGCGA CGAGATCGCC CGCGGCCGCG CCATCATTCC GGCCAACATC AATCACGGCG AACTCGAGCC GATGATCATC GGCCGCAACT TCCTCACCAA GATCAACGCC AATATCGGCA ACTCGGCGGT GACCTCGTCG GTCGAGGAGG AAGTCGACAA GATGGTGTGG GCGATCCGCT GGGGTGCCGA CACCGTGATG GACCTCTCGA CCGGCCGCAA CATTCATACC ACCCGCGAAT GGATCTTGCG CAACTCGCCG GTGCCGATCG GCACCGTGCC GATCTATCAG GCGCTGGAGA AGTGCGAAGG CGATCCGGTC AAGCTGACCT GGGAGCTGTA CAAGGACACG CTGATCGAGC AGGCCGAACA GGGCGTCGAT TACTTCACCA TCCACGCCGG CGTGCGGCTG CAGTACATCC ACCTCACCGC CAGCCGGGTC ACCGGCATCG TGTCGCGCGG CGGCTCGATC ATGGCGAAGT GGTGCCTGGC GCATCACAAG GAGAGCTTCC TCTACACGCA TTTCGACGAG ATCTGCGACC TGATGCGGAA GTACGACGTG TCGTTCTCGC TCGGCGACGG CCTGCGGCCG GGCTCGATCG CGGACGCCAA CGACCGCGCC CAGTTCGCCG AACTGGAGAC GCTCGGCGAG CTCACCAAGA TCGCCTGGGC CAAGGGCTGC CAGGTGATGA TCGAAGGCCC CGGCCACGTG CCGATGCACA AGATCAAGAT CAACATGGAC AAGCAGCTCA AGGAGTGCGG CGAGGCGCCG TTCTACACCT TGGGCCCGCT GACCACCGAT ATCGCACCGG GCTATGATCA CATCACTTCC GGCATCGGCG CCGCGATGAT CGGCTGGTTC GGCTGCGCGA TGCTGTGCTA CGTCACGCCG AAGGAGCATC TCGGCCTGCC CGACCGCAAT GACGTCAAGA CCGGCGTGAT CACCTACAAG ATCGCCGCCC ACGCCGCCGA CCTCGCCAAG GGCCACCCCG CCGCCCAGCT CCGCGACGAC GCACTCTCCC GTGCAAGGTT CGAATTCCGC TGGCAGGACC AGTTCAATCT CGGCCTCGAT CCGGACACGG CGCAGGCCTT CCACGACGAG ACCCTACCGA AGGACGCCCA CAAGGTCGCG CATTTCTGCT CGATGTGCGG CCCGAAATTC TGCTCGATGA AGATCACGCA GGACGTCCGC GACTACGCCG CCGGCCTCGG CGACAACGAG AAAGCCGCCC TCTACCCGGT CGGCCACGCC GGCATGACCA TCTCCGGCAC CATCGAAGAC GGCATGGCCC AGATGAGCGC CAAGTTCAAA GAGATGGGAA GCAGCGTGTA TCTCGATGCC GACAAGGTGA AAGAGAGCAA CAAGGCGCTG TCGTAA
|
Protein sequence | MNIRSNPETT RAAVTTGALP SSKKIYATPA SAPDLRVPLR EIILSEGAGE PNLPVYDTSG PYTDPTVVID VNKGLPRPRT EWVKQRGGVE QYEGRDIKPE DNGNVGAAHA AKAFTAHHQP LRGISDAPIT QYEFARRGII TKEMIYVAER ENLGRKQQLE RAEAALADGE SFGAAVPAFI TPEFVRDEIA RGRAIIPANI NHGELEPMII GRNFLTKINA NIGNSAVTSS VEEEVDKMVW AIRWGADTVM DLSTGRNIHT TREWILRNSP VPIGTVPIYQ ALEKCEGDPV KLTWELYKDT LIEQAEQGVD YFTIHAGVRL QYIHLTASRV TGIVSRGGSI MAKWCLAHHK ESFLYTHFDE ICDLMRKYDV SFSLGDGLRP GSIADANDRA QFAELETLGE LTKIAWAKGC QVMIEGPGHV PMHKIKINMD KQLKECGEAP FYTLGPLTTD IAPGYDHITS GIGAAMIGWF GCAMLCYVTP KEHLGLPDRN DVKTGVITYK IAAHAADLAK GHPAAQLRDD ALSRARFEFR WQDQFNLGLD PDTAQAFHDE TLPKDAHKVA HFCSMCGPKF CSMKITQDVR DYAAGLGDNE KAALYPVGHA GMTISGTIED GMAQMSAKFK EMGSSVYLDA DKVKESNKAL S
|
| |