Gene P9515_17991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9515_17991 
SymbolthiC 
ID4719968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9515 
KingdomBacteria 
Replicon accessionNC_008817 
Strand
Start bp1585928 
End bp1587298 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content37% 
IMG OID640081498 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_001012113 
Protein GI123967032 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAATT CTTGGATAAA GCCTCGCCTT GGACAAAAAA ATATTACTCA GATGAATTTT 
GCTAAAAATG GAATTATAAC TGAAGAAATG AACTATGTTG CTCAAAAAGA GAACCTTCCA
TCTTCATTAA TTATGGAAGA AGTTGCAAGG GGCAGATTAA TAATTCCAGC TAATATAAAT
CATGTAAATC TTGAACCAAT GGCGATTGGT ATTGCTTCTA AATGCAAAGT AAATGCAAAT
ATTGGTGCTT CCCCCAATGC AAGTGATATA AATGAAGAAG TTGAGAAGCT TAGGTTAGCG
GTCAAATATG GTGCTGATAC AGTTATGGAT TTATCTACGG GTGGAGTAAA TCTTGATGAG
GTGAGACAAG CAATTATCAA AGAATCTTCT GTCCCTATTG GTACTGTTCC AGTTTATCAA
GCTTTAGAAA GTGTTCATGG ATCTATAGAC AGATTAACAG AAGACGATTT CTTACATATT
ATTGAAAAAC ATTGTCAGCA AGGAGTTGAT TATCAAACGA TTCATGCTGG TTTATTAATA
GAACATTTAC CTAAAGTAAA AGGAAGAATT ACTGGCATCG TTAGTCGAGG TGGAGGAATT
CTTGCTCAAT GGATGTTGCA TCATTTTAAG CAAAACCCCT TGTATACAAG ATTTGATGAT
ATTTGTGAAA TTTTCAAAAA ATATGATTGT ACTTTCTCTT TAGGAGATTC ACTTAGACCG
GGATGTTTAC ATGATGCATC AGATGATGCT CAATTAGCTG AATTGAAAAC ATTGGGTGAG
CTTACAAGAA GAGCTTGGGC TCATAATGTT CAGGTTATGG TGGAAGGTCC AGGGCATGTC
CCTATGGATC AAATTGAGTT TAATGTTCGA AAACAGATGG AAGAATGTTC AGAAGCTCCT
TTTTATGTCC TAGGACCATT AGTAACTGAT ATCTCTCCTG GCTATGATCA TATTTCAAGT
GCTATCGGCG CTGCAATGGC AGGATGGTAC GGAACTGCGA TGTTATGTTA TGTCACTCCT
AAAGAGCATT TGGGTCTCCC AAATGCTGAA GATGTAAGAG AGGGGTTAAT AGCCTATAAA
ATCGCTGCAC ATGCAGCAGA TATCGCTAGG CATAGAGCGG GGGCTCGTGA TAGAGATGAT
GAGCTAAGTC ACGCAAGATA TACTTTTGAC TGGAATAAAC AGTTTGAACT TTCTTTAGAT
CCTGAAAGGG CTAAACAATA TCATGATGAA ACTTTACCAG AAGAAATATT TAAAAAAGCT
GAGTTCTGTT CAATGTGTGG TCCTAAGCAT TGCCCCATGA ATTCAAAAAT TTCTGATGAA
ACACTTGATC AATTGAATAA TAAACTCGCA AAATGTGACA TTAAAGTTTA G
 
Protein sequence
MRNSWIKPRL GQKNITQMNF AKNGIITEEM NYVAQKENLP SSLIMEEVAR GRLIIPANIN 
HVNLEPMAIG IASKCKVNAN IGASPNASDI NEEVEKLRLA VKYGADTVMD LSTGGVNLDE
VRQAIIKESS VPIGTVPVYQ ALESVHGSID RLTEDDFLHI IEKHCQQGVD YQTIHAGLLI
EHLPKVKGRI TGIVSRGGGI LAQWMLHHFK QNPLYTRFDD ICEIFKKYDC TFSLGDSLRP
GCLHDASDDA QLAELKTLGE LTRRAWAHNV QVMVEGPGHV PMDQIEFNVR KQMEECSEAP
FYVLGPLVTD ISPGYDHISS AIGAAMAGWY GTAMLCYVTP KEHLGLPNAE DVREGLIAYK
IAAHAADIAR HRAGARDRDD ELSHARYTFD WNKQFELSLD PERAKQYHDE TLPEEIFKKA
EFCSMCGPKH CPMNSKISDE TLDQLNNKLA KCDIKV