Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_18211 |
Symbol | thiC |
ID | 4718558 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 1554194 |
End bp | 1555564 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640079554 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_001010211 |
Protein GI | 123969353 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.443876 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAGTT CTTGGATTAA GCCTCGCCTA GGGAAAGACA ATGTAACTCA GATGAACTTT GCGAGAAATG GATATATCAC TGAAGAAATG GATTTTGTTG CTAAAAAAGA GAATCTTCCT TCTTCTTTAA TAATGGAAGA AGTAGCAAGA GGAAGATTAA TTATTCCAGC TAATATTAAT CATTTGAATC TTGAGCCAAT GTCTATAGGT ATTGCTTCAA GATGCAAAGT TAATGCCAAT ATTGGTGCTT CCCCTAACGC AAGTGATATC AATGAAGAAG TAGAAAAGCT CAAACTTGCT GTTAAATATG GTGCTGATAC GGTTATGGAT CTTTCTACGG GAGGAGTAAA TTTAGATGAA GTCCGACAAG CAATTATTCA AGAATCTCCT GTTCCCATCG GAACTGTTCC TGTTTATCAA GCTTTAGAAA GTGTTCATGG TTCAATAGAT AGACTAACAG AAGATGATTT TCTTCATATT ATTGAAAAAC ATTGTCAGCA GGGCGTAGAT TATCAAACTA TTCATGCTGG TCTATTAATA GAGCATTTGC CAAAAGTTAA AGGAAGAATT ACTGGAATTG TCAGTAGAGG GGGAGGTATT TTAGCCCAAT GGATGTTACA TCATTTTAAA CAAAATCCTC TTTACACAAG GTTTGATGAT ATCTGTGAGA TTTTCAAGAA ATATGATTGT ACTTTCTCTC TCGGAGATTC GCTTAGGCCT GGATGTTTGC ATGATGCTTC TGATGATGCT CAGCTAGCTG AATTGAAGAC CTTAGGCGAG CTTACTCGAA GAGCATGGGA ACATAATGTT CAAGTAATGG TTGAAGGTCC TGGTCATGTC CCTATGGATC AAATTGAGTT TAATGTGAGA AAGCAAATGG AAGAATGTTC AGAAGCCCCT TTCTATGTAC TTGGTCCATT AGTAACAGAT ATTTCTCCTG GTTATGACCA TATATCAAGT GCTATTGGGG CGGCAATGGC GGGGTGGTAT GGAACTTCGA TGCTATGTTA TGTAACCCCA AAAGAACATC TAGGCCTTCC AAATGCAGAA GATGTAAGAG AAGGATTAAT TGCTTATAAA ATAGCTGCTC ACGCTGCTGA TATAGCAAGA CATAGAGCTG GAGCTCGTGA TCGAGATGAT GAACTTAGTC ATGCAAGGTA TAACTTTGAT TGGAATAAAC AATTCGAACT TTCTTTAGAT CCGGAAAGGG CAAAGCAGTA CCATGATGAA ACATTGCCTG AAGAGATCTT TAAAAAGGCT GAGTTTTGTT CAATGTGTGG CCCAAAACAT TGTCCAATGA ATTCAAAGAT TTCAGATGAA TCACTAGATC AACTAAAAGA TAAACTTGAA GAATGTAATA CTTCAGTTTA G
|
Protein sequence | MRSSWIKPRL GKDNVTQMNF ARNGYITEEM DFVAKKENLP SSLIMEEVAR GRLIIPANIN HLNLEPMSIG IASRCKVNAN IGASPNASDI NEEVEKLKLA VKYGADTVMD LSTGGVNLDE VRQAIIQESP VPIGTVPVYQ ALESVHGSID RLTEDDFLHI IEKHCQQGVD YQTIHAGLLI EHLPKVKGRI TGIVSRGGGI LAQWMLHHFK QNPLYTRFDD ICEIFKKYDC TFSLGDSLRP GCLHDASDDA QLAELKTLGE LTRRAWEHNV QVMVEGPGHV PMDQIEFNVR KQMEECSEAP FYVLGPLVTD ISPGYDHISS AIGAAMAGWY GTSMLCYVTP KEHLGLPNAE DVREGLIAYK IAAHAADIAR HRAGARDRDD ELSHARYNFD WNKQFELSLD PERAKQYHDE TLPEEIFKKA EFCSMCGPKH CPMNSKISDE SLDQLKDKLE ECNTSV
|
| |