Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PMN2A_1188 |
Symbol | |
ID | 3606581 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL2A |
Kingdom | Bacteria |
Replicon accession | NC_007335 |
Strand | - |
Start bp | 1677638 |
End bp | 1679038 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 637688063 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_292381 |
Protein GI | 72383026 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGAATT CATGGGTGGC TTCTAGAAAA GGTAAAACCA ATGTTTCTCA GATGCATTTT GCTCGCAAAG GCGAAATTAC TGAAGAAATG AGGTATGTGG CAAAGCGTGA GAATCTTCCT GAGTCTCTGG TTATGGAAGA AGTCGCGCGC GGTCGAATGG TTATTCCTGC AAATATTAAC CATATGAACT TAGAGCCGAT GGCAATAGGT ATTGCCTCAA CATGTAAAGT CAATGCAAAT ATTGGTGCTT CACCAAATGC AAGCGATATT AGTGAAGAAT TAAAGAAGCT TGATCTAGCA GTAAAATATG GGGCTGATAC TCTTATGGAT CTTTCTACTG GAGGGGTTAA TTTAGATGAG GTACGAACTG AAATTATTAA TGCCTCCCCT ATCCCGATAG GGACAGTTCC TGTTTATCAA GCTTTAGAAA GTGTTCACGG TTCTATTTCA AGGTTAAATG AGGATGATTT TTTACACATA ATAGAAAAGC ATTGTCAGCA AGGAGTTGAT TATCAAACCA TTCATGCAGG CTTATTGATT GAACATTTAC CCAAAGTTAA AGGTCGTATT ACTGGAATAG TTAGTCGTGG CGGAGGAATT CTTGCCCAAT GGATGCTTTA TCACTACAAA CAAAATCCTC TATTTACTCG TTTTGATGAT ATTTGTGAAA TTTTTAAACG CTATGACTGC ACCTTTTCTT TAGGTGATTC TCTAAGGCCT GGATGTCTGC ATGATGCATC AGATGAAGCT CAACTCGCTG AATTGAAAAC TCTAGGTGAA TTGACTAGAC GTGCTTGGAA GCATGATGTT CAAGTCATGG TTGAAGGGCC TGGTCATGTA CCTATGGATC AAATCGAATT CAATGTTAGG AAGCAAATGG AGGAGTGTTC AGAAGCTCCC TTTTATGTTC TAGGTCCATT GGTAACAGAC ATTTCTCCTG GTTATGATCA CATTTCAAGT GCTATTGGTG CAGCCATGGC AGGTTGGTAC GGGACTGCGA TGCTTTGTTA TGTAACACCT AAGGAACATC TTGGGTTGCC TAATCCTGAG GATGTTAGAG AAGGTTTAAT TGCTTATAAA ATTGCTGCTC ATGCTGCAGA TGTCGCAAGA CATAGATCAG GAGCTCGTGA TCGTGATGAT GAATTAAGTA AGGCTCGTAA AGAATTTGAC TGGAACAAAC AATTTGAATT GTCCTTAGAT CCAGAAAGAG CCAAGCAATA TCATGACGAA ACTTTACCTG AAGAAATTTT CAAGAAAGCA GAGTTTTGTT CAATGTGCGG TCCTAATCAT TGTCCAATGA ATACAAAAAT CACAGATGAA GATCTTGATA AATTAAACGA TCAAATACAG TCAAAAGGTG CAGCTGAATT AACTCCAGTA AAGTTAAACA AAGAAAACTA G
|
Protein sequence | MRNSWVASRK GKTNVSQMHF ARKGEITEEM RYVAKRENLP ESLVMEEVAR GRMVIPANIN HMNLEPMAIG IASTCKVNAN IGASPNASDI SEELKKLDLA VKYGADTLMD LSTGGVNLDE VRTEIINASP IPIGTVPVYQ ALESVHGSIS RLNEDDFLHI IEKHCQQGVD YQTIHAGLLI EHLPKVKGRI TGIVSRGGGI LAQWMLYHYK QNPLFTRFDD ICEIFKRYDC TFSLGDSLRP GCLHDASDEA QLAELKTLGE LTRRAWKHDV QVMVEGPGHV PMDQIEFNVR KQMEECSEAP FYVLGPLVTD ISPGYDHISS AIGAAMAGWY GTAMLCYVTP KEHLGLPNPE DVREGLIAYK IAAHAADVAR HRSGARDRDD ELSKARKEFD WNKQFELSLD PERAKQYHDE TLPEEIFKKA EFCSMCGPNH CPMNTKITDE DLDKLNDQIQ SKGAAELTPV KLNKEN
|
| |