Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_01971 |
Symbol | thiC |
ID | 4777241 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 215312 |
End bp | 216691 |
Gene Length | 1380 bp |
Protein Length | 459 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640085696 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_001016217 |
Protein GI | 124021910 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGCCT CTTGGGTGGC TGCCCGTAAG GGTCAGGCCA ATGTCTCGCA ATTGCATTTC GCTCGACAGG GCGTTGTCAC TCAAGAAATG GACTACGTGG CCAGGCGGGA AAACTTGCCT GAATCGCTTG TCATGGAGGA GGTGGCCCGG GGCAGGATGA TTATTCCTGC CAACATCAAT CATGCAAATT TAGAGCCGAT GGCGATTGGT ATCGCCTCCA GCTGCAAGGT GAATGCAAAC ATTGGCGCCT CACCTAACGC CAGTGATGTA GCTGAAGAGC TCAAGAAGCT CGAGCTGGCA GTTAAATATG GCGCAGACAC CGTGATGGAT CTGTCCACTG GAGGGGTCAA TCTTGATGAG GTGCGCACGG CGATCATTAA TGCTTCACCC GTGCCGATCG GCACTGTGCC TGTCTATCAG GCGTTGGAAA GCGTGCATGG CTCGATTGAG AAGCTCGACG AAGATGACTT CCTACACATC ATTGAGAAGC ATTGCCAGCA GGGTGTCGAC TATCAAACCA TTCACGCCGG TTTGTTGATT GAGCACCTTC CGTTGGTGAA GGGACGCCTG ACAGGCATCG TCAGTCGCGG GGGTGGAATT CTTGCTCAGT GGATGCTTTA TCACCACAGA CAGAACCCTC TTTTCACCCG CTTTGACGAC ATCTGCGAGA TCTTCAAGCG CTACGACTGC AGTTTTTCAC TTGGTGATTC TCTTCGTCCT GGTTGTCAGC ACGATGCTTC TGATGCAGCT CAACTTGCCG AGTTGAAGAC CCTTGGAGAA TTGACTAAGA GAGCTTGGGC ACATGACGTG CAGGTGATGG TCGAGGGTCC TGGTCATGTA CCAATGGATC AGATCGAATT CAATGTGCGC AAGCAGATGG AAGAGTGCAA TGAGGCACCC TTTTATGTGC TTGGCCCTTT GGTGACAGAC ATCGCACCGG GTTATGACCA CATCACGAGT GCCATCGGTG CGGCGATGGC AGGCTGGTAT GGAACAGCGA TGCTTTGTTA TGTGACCCCG AAGGAGCATT TGGGTCTGCC AAACCCTGAG GATGTTCGTG AGGGCTTGAT TGCCTACAAA ATTGCAGCGC ATGCCGCTGA CATCGCTCGT CACCGTCCGG GTGCTCGAGA TCGCGATGAT GAATTAAGCC GAGCAAGGTA CAACTTTGAT TGGAACAAAC AGTTTGAGCT TTCACTTGAT CCAGAGCGAG CCAAGCAGTA TCACGATGAA ACTTTGCCAG CTGACATTTA CAAGCAAGCT GAGTTTTGTT CAATGTGTGG TCCAAAGCAT TGTCCAATGC AGACCAAAAT TACGGATGAG GATCTAGAAG GTCTCGAAAA ATCTCTCAAA AGTAAAGGGA AAGCTGAGTT GCCAGCTTAG
|
Protein sequence | MRASWVAARK GQANVSQLHF ARQGVVTQEM DYVARRENLP ESLVMEEVAR GRMIIPANIN HANLEPMAIG IASSCKVNAN IGASPNASDV AEELKKLELA VKYGADTVMD LSTGGVNLDE VRTAIINASP VPIGTVPVYQ ALESVHGSIE KLDEDDFLHI IEKHCQQGVD YQTIHAGLLI EHLPLVKGRL TGIVSRGGGI LAQWMLYHHR QNPLFTRFDD ICEIFKRYDC SFSLGDSLRP GCQHDASDAA QLAELKTLGE LTKRAWAHDV QVMVEGPGHV PMDQIEFNVR KQMEECNEAP FYVLGPLVTD IAPGYDHITS AIGAAMAGWY GTAMLCYVTP KEHLGLPNPE DVREGLIAYK IAAHAADIAR HRPGARDRDD ELSRARYNFD WNKQFELSLD PERAKQYHDE TLPADIYKQA EFCSMCGPKH CPMQTKITDE DLEGLEKSLK SKGKAELPA
|
| |