Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9515_17991 |
Symbol | thiC |
ID | 4719968 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9515 |
Kingdom | Bacteria |
Replicon accession | NC_008817 |
Strand | - |
Start bp | 1585928 |
End bp | 1587298 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640081498 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_001012113 |
Protein GI | 123967032 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAATT CTTGGATAAA GCCTCGCCTT GGACAAAAAA ATATTACTCA GATGAATTTT GCTAAAAATG GAATTATAAC TGAAGAAATG AACTATGTTG CTCAAAAAGA GAACCTTCCA TCTTCATTAA TTATGGAAGA AGTTGCAAGG GGCAGATTAA TAATTCCAGC TAATATAAAT CATGTAAATC TTGAACCAAT GGCGATTGGT ATTGCTTCTA AATGCAAAGT AAATGCAAAT ATTGGTGCTT CCCCCAATGC AAGTGATATA AATGAAGAAG TTGAGAAGCT TAGGTTAGCG GTCAAATATG GTGCTGATAC AGTTATGGAT TTATCTACGG GTGGAGTAAA TCTTGATGAG GTGAGACAAG CAATTATCAA AGAATCTTCT GTCCCTATTG GTACTGTTCC AGTTTATCAA GCTTTAGAAA GTGTTCATGG ATCTATAGAC AGATTAACAG AAGACGATTT CTTACATATT ATTGAAAAAC ATTGTCAGCA AGGAGTTGAT TATCAAACGA TTCATGCTGG TTTATTAATA GAACATTTAC CTAAAGTAAA AGGAAGAATT ACTGGCATCG TTAGTCGAGG TGGAGGAATT CTTGCTCAAT GGATGTTGCA TCATTTTAAG CAAAACCCCT TGTATACAAG ATTTGATGAT ATTTGTGAAA TTTTCAAAAA ATATGATTGT ACTTTCTCTT TAGGAGATTC ACTTAGACCG GGATGTTTAC ATGATGCATC AGATGATGCT CAATTAGCTG AATTGAAAAC ATTGGGTGAG CTTACAAGAA GAGCTTGGGC TCATAATGTT CAGGTTATGG TGGAAGGTCC AGGGCATGTC CCTATGGATC AAATTGAGTT TAATGTTCGA AAACAGATGG AAGAATGTTC AGAAGCTCCT TTTTATGTCC TAGGACCATT AGTAACTGAT ATCTCTCCTG GCTATGATCA TATTTCAAGT GCTATCGGCG CTGCAATGGC AGGATGGTAC GGAACTGCGA TGTTATGTTA TGTCACTCCT AAAGAGCATT TGGGTCTCCC AAATGCTGAA GATGTAAGAG AGGGGTTAAT AGCCTATAAA ATCGCTGCAC ATGCAGCAGA TATCGCTAGG CATAGAGCGG GGGCTCGTGA TAGAGATGAT GAGCTAAGTC ACGCAAGATA TACTTTTGAC TGGAATAAAC AGTTTGAACT TTCTTTAGAT CCTGAAAGGG CTAAACAATA TCATGATGAA ACTTTACCAG AAGAAATATT TAAAAAAGCT GAGTTCTGTT CAATGTGTGG TCCTAAGCAT TGCCCCATGA ATTCAAAAAT TTCTGATGAA ACACTTGATC AATTGAATAA TAAACTCGCA AAATGTGACA TTAAAGTTTA G
|
Protein sequence | MRNSWIKPRL GQKNITQMNF AKNGIITEEM NYVAQKENLP SSLIMEEVAR GRLIIPANIN HVNLEPMAIG IASKCKVNAN IGASPNASDI NEEVEKLRLA VKYGADTVMD LSTGGVNLDE VRQAIIKESS VPIGTVPVYQ ALESVHGSID RLTEDDFLHI IEKHCQQGVD YQTIHAGLLI EHLPKVKGRI TGIVSRGGGI LAQWMLHHFK QNPLYTRFDD ICEIFKKYDC TFSLGDSLRP GCLHDASDDA QLAELKTLGE LTRRAWAHNV QVMVEGPGHV PMDQIEFNVR KQMEECSEAP FYVLGPLVTD ISPGYDHISS AIGAAMAGWY GTAMLCYVTP KEHLGLPNAE DVREGLIAYK IAAHAADIAR HRAGARDRDD ELSHARYTFD WNKQFELSLD PERAKQYHDE TLPEEIFKKA EFCSMCGPKH CPMNSKISDE TLDQLNNKLA KCDIKV
|
| |