Gene PMN2A_0840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPMN2A_0840 
Symbol 
ID3606221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL2A 
KingdomBacteria 
Replicon accessionNC_007335 
Strand
Start bp1346030 
End bp1347082 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content36% 
IMG OID637687706 
Productthiamine-phosphate pyrophosphorylase 
Protein accessionYP_292034 
Protein GI72382679 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0352] Thiamine monophosphate synthase 
TIGRFAM ID[TIGR00693] thiamine-phosphate pyrophosphorylase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.373367 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATCAA TACCTGTCAC CCCTCCTTCT GATAATCGTA TTGCTCAATT AATTGACGCG 
AACCTTGATC GCGCTAGAGA GGGGCTTAGA GTTATGGAGG ATTGGTGCAG ATTTGGTTTA
AAGAGGAGTG ATTTTTCGAT TCAAATCAAA GATTGGAGGC AACAATTAGG AGTACATCAC
CACAATATTT ATCGAAAAGA AAGGCTTACA TCCATCGATC CAGCTATGGG CATTTCACAT
CCGTTACAAA CAGTCAGATC AACCCCAGAG GATGTATTTA TTGCGAACTC ATCCAGAGTT
CAAGAAGCCC TAAGAGTAAT AGAGGAATTC ACTCGAACAA CAGATCCGCA TCTTTGTGAA
ATAGCTAGCA AAATTAGATA CGAAACTTAT GATATCGAGA TAAAGGTACT TAATTCGACA
GAAGGCGTAA ATAAAAGAGA AATCTTAAAA GATTGTTCCT TATACTTAAT AACCTCAAAC
AGTAGAGATC TAGAAGAGGT TGTTCTTTAT GCTCTAAAAG CTGGTGTAAA AATAGTTCAA
TATAGAGAGA AATTTTTAAA CGATAATGAA AAAATTTCAC AAGCTAAATG TTTAGCCTCT
CTTTGTAAAA AATTCAATTC ACTTTTTATA GTCAATGACC GTATTGATAT CGCACTCGCC
GTTGACGCCG ATGGAATTCA TTTGGGACAA GAAGATATGC CAACAAAAAT CGCGCGACAG
CTACTAGGGG CTGAAAAAAT CATTGGCAGA AGCACGCACT GTCTTGAAGA CATCAAAAAT
GCCGAAGCAG AAGGCTGTGA TTATATTGGT ATAGGACCAA TATTTCCCTC TGAAACAAAA
AAGCAACTAA ATCCAATTGG AATTGACTAC CTAAAAAAAG GATTAAGTGA AACTCTCCTA
CCTGCTTTTG CTATTGGGGG AATGAATAAA TCAAATATCA CAAAATTAAA CCAAATCAAT
AACCTTCGCA TAGCTGTGTG CAATGCAATC ATTAATTCAA ATAATCCCTT TTCAACAACT
GATGAACTTA TCAAACTTCT AAAATGCAAT TAA
 
Protein sequence
MKSIPVTPPS DNRIAQLIDA NLDRAREGLR VMEDWCRFGL KRSDFSIQIK DWRQQLGVHH 
HNIYRKERLT SIDPAMGISH PLQTVRSTPE DVFIANSSRV QEALRVIEEF TRTTDPHLCE
IASKIRYETY DIEIKVLNST EGVNKREILK DCSLYLITSN SRDLEEVVLY ALKAGVKIVQ
YREKFLNDNE KISQAKCLAS LCKKFNSLFI VNDRIDIALA VDADGIHLGQ EDMPTKIARQ
LLGAEKIIGR STHCLEDIKN AEAEGCDYIG IGPIFPSETK KQLNPIGIDY LKKGLSETLL
PAFAIGGMNK SNITKLNQIN NLRIAVCNAI INSNNPFSTT DELIKLLKCN