Gene P9303_01971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_01971 
SymbolthiC 
ID4777241 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp215312 
End bp216691 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content52% 
IMG OID640085696 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_001016217 
Protein GI124021910 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGCCT CTTGGGTGGC TGCCCGTAAG GGTCAGGCCA ATGTCTCGCA ATTGCATTTC 
GCTCGACAGG GCGTTGTCAC TCAAGAAATG GACTACGTGG CCAGGCGGGA AAACTTGCCT
GAATCGCTTG TCATGGAGGA GGTGGCCCGG GGCAGGATGA TTATTCCTGC CAACATCAAT
CATGCAAATT TAGAGCCGAT GGCGATTGGT ATCGCCTCCA GCTGCAAGGT GAATGCAAAC
ATTGGCGCCT CACCTAACGC CAGTGATGTA GCTGAAGAGC TCAAGAAGCT CGAGCTGGCA
GTTAAATATG GCGCAGACAC CGTGATGGAT CTGTCCACTG GAGGGGTCAA TCTTGATGAG
GTGCGCACGG CGATCATTAA TGCTTCACCC GTGCCGATCG GCACTGTGCC TGTCTATCAG
GCGTTGGAAA GCGTGCATGG CTCGATTGAG AAGCTCGACG AAGATGACTT CCTACACATC
ATTGAGAAGC ATTGCCAGCA GGGTGTCGAC TATCAAACCA TTCACGCCGG TTTGTTGATT
GAGCACCTTC CGTTGGTGAA GGGACGCCTG ACAGGCATCG TCAGTCGCGG GGGTGGAATT
CTTGCTCAGT GGATGCTTTA TCACCACAGA CAGAACCCTC TTTTCACCCG CTTTGACGAC
ATCTGCGAGA TCTTCAAGCG CTACGACTGC AGTTTTTCAC TTGGTGATTC TCTTCGTCCT
GGTTGTCAGC ACGATGCTTC TGATGCAGCT CAACTTGCCG AGTTGAAGAC CCTTGGAGAA
TTGACTAAGA GAGCTTGGGC ACATGACGTG CAGGTGATGG TCGAGGGTCC TGGTCATGTA
CCAATGGATC AGATCGAATT CAATGTGCGC AAGCAGATGG AAGAGTGCAA TGAGGCACCC
TTTTATGTGC TTGGCCCTTT GGTGACAGAC ATCGCACCGG GTTATGACCA CATCACGAGT
GCCATCGGTG CGGCGATGGC AGGCTGGTAT GGAACAGCGA TGCTTTGTTA TGTGACCCCG
AAGGAGCATT TGGGTCTGCC AAACCCTGAG GATGTTCGTG AGGGCTTGAT TGCCTACAAA
ATTGCAGCGC ATGCCGCTGA CATCGCTCGT CACCGTCCGG GTGCTCGAGA TCGCGATGAT
GAATTAAGCC GAGCAAGGTA CAACTTTGAT TGGAACAAAC AGTTTGAGCT TTCACTTGAT
CCAGAGCGAG CCAAGCAGTA TCACGATGAA ACTTTGCCAG CTGACATTTA CAAGCAAGCT
GAGTTTTGTT CAATGTGTGG TCCAAAGCAT TGTCCAATGC AGACCAAAAT TACGGATGAG
GATCTAGAAG GTCTCGAAAA ATCTCTCAAA AGTAAAGGGA AAGCTGAGTT GCCAGCTTAG
 
Protein sequence
MRASWVAARK GQANVSQLHF ARQGVVTQEM DYVARRENLP ESLVMEEVAR GRMIIPANIN 
HANLEPMAIG IASSCKVNAN IGASPNASDV AEELKKLELA VKYGADTVMD LSTGGVNLDE
VRTAIINASP VPIGTVPVYQ ALESVHGSIE KLDEDDFLHI IEKHCQQGVD YQTIHAGLLI
EHLPLVKGRL TGIVSRGGGI LAQWMLYHHR QNPLFTRFDD ICEIFKRYDC SFSLGDSLRP
GCQHDASDAA QLAELKTLGE LTKRAWAHDV QVMVEGPGHV PMDQIEFNVR KQMEECNEAP
FYVLGPLVTD IAPGYDHITS AIGAAMAGWY GTAMLCYVTP KEHLGLPNPE DVREGLIAYK
IAAHAADIAR HRPGARDRDD ELSRARYNFD WNKQFELSLD PERAKQYHDE TLPADIYKQA
EFCSMCGPKH CPMQTKITDE DLEGLEKSLK SKGKAELPA