Gene PMN2A_1188 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPMN2A_1188 
Symbol 
ID3606581 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL2A 
KingdomBacteria 
Replicon accessionNC_007335 
Strand
Start bp1677638 
End bp1679038 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content39% 
IMG OID637688063 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_292381 
Protein GI72383026 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGAATT CATGGGTGGC TTCTAGAAAA GGTAAAACCA ATGTTTCTCA GATGCATTTT 
GCTCGCAAAG GCGAAATTAC TGAAGAAATG AGGTATGTGG CAAAGCGTGA GAATCTTCCT
GAGTCTCTGG TTATGGAAGA AGTCGCGCGC GGTCGAATGG TTATTCCTGC AAATATTAAC
CATATGAACT TAGAGCCGAT GGCAATAGGT ATTGCCTCAA CATGTAAAGT CAATGCAAAT
ATTGGTGCTT CACCAAATGC AAGCGATATT AGTGAAGAAT TAAAGAAGCT TGATCTAGCA
GTAAAATATG GGGCTGATAC TCTTATGGAT CTTTCTACTG GAGGGGTTAA TTTAGATGAG
GTACGAACTG AAATTATTAA TGCCTCCCCT ATCCCGATAG GGACAGTTCC TGTTTATCAA
GCTTTAGAAA GTGTTCACGG TTCTATTTCA AGGTTAAATG AGGATGATTT TTTACACATA
ATAGAAAAGC ATTGTCAGCA AGGAGTTGAT TATCAAACCA TTCATGCAGG CTTATTGATT
GAACATTTAC CCAAAGTTAA AGGTCGTATT ACTGGAATAG TTAGTCGTGG CGGAGGAATT
CTTGCCCAAT GGATGCTTTA TCACTACAAA CAAAATCCTC TATTTACTCG TTTTGATGAT
ATTTGTGAAA TTTTTAAACG CTATGACTGC ACCTTTTCTT TAGGTGATTC TCTAAGGCCT
GGATGTCTGC ATGATGCATC AGATGAAGCT CAACTCGCTG AATTGAAAAC TCTAGGTGAA
TTGACTAGAC GTGCTTGGAA GCATGATGTT CAAGTCATGG TTGAAGGGCC TGGTCATGTA
CCTATGGATC AAATCGAATT CAATGTTAGG AAGCAAATGG AGGAGTGTTC AGAAGCTCCC
TTTTATGTTC TAGGTCCATT GGTAACAGAC ATTTCTCCTG GTTATGATCA CATTTCAAGT
GCTATTGGTG CAGCCATGGC AGGTTGGTAC GGGACTGCGA TGCTTTGTTA TGTAACACCT
AAGGAACATC TTGGGTTGCC TAATCCTGAG GATGTTAGAG AAGGTTTAAT TGCTTATAAA
ATTGCTGCTC ATGCTGCAGA TGTCGCAAGA CATAGATCAG GAGCTCGTGA TCGTGATGAT
GAATTAAGTA AGGCTCGTAA AGAATTTGAC TGGAACAAAC AATTTGAATT GTCCTTAGAT
CCAGAAAGAG CCAAGCAATA TCATGACGAA ACTTTACCTG AAGAAATTTT CAAGAAAGCA
GAGTTTTGTT CAATGTGCGG TCCTAATCAT TGTCCAATGA ATACAAAAAT CACAGATGAA
GATCTTGATA AATTAAACGA TCAAATACAG TCAAAAGGTG CAGCTGAATT AACTCCAGTA
AAGTTAAACA AAGAAAACTA G
 
Protein sequence
MRNSWVASRK GKTNVSQMHF ARKGEITEEM RYVAKRENLP ESLVMEEVAR GRMVIPANIN 
HMNLEPMAIG IASTCKVNAN IGASPNASDI SEELKKLDLA VKYGADTLMD LSTGGVNLDE
VRTEIINASP IPIGTVPVYQ ALESVHGSIS RLNEDDFLHI IEKHCQQGVD YQTIHAGLLI
EHLPKVKGRI TGIVSRGGGI LAQWMLYHYK QNPLFTRFDD ICEIFKRYDC TFSLGDSLRP
GCLHDASDEA QLAELKTLGE LTRRAWKHDV QVMVEGPGHV PMDQIEFNVR KQMEECSEAP
FYVLGPLVTD ISPGYDHISS AIGAAMAGWY GTAMLCYVTP KEHLGLPNPE DVREGLIAYK
IAAHAADVAR HRSGARDRDD ELSKARKEFD WNKQFELSLD PERAKQYHDE TLPEEIFKKA
EFCSMCGPNH CPMNTKITDE DLDKLNDQIQ SKGAAELTPV KLNKEN