Gene Synpcc7942_1057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_1057 
Symbol 
ID3773987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp1067527 
End bp1068558 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content60% 
IMG OID637799479 
Productthiamine-phosphate pyrophosphorylase 
Protein accessionYP_400074 
Protein GI81299866 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0352] Thiamine monophosphate synthase 
TIGRFAM ID[TIGR00693] thiamine-phosphate pyrophosphorylase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.004946 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.0997928 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTCTG AGTCGAGCAT GGATTGGGTT GAAACCCGTT GCCACCGCAT TCTGGATGCC 
AACCTCGATC GCGCCCGTGA AGGGTTGCGC ATTCTGGAAG AGTGGTGCCG CTTTGGCCTC
GAGCGCGCTG ACTTGAGCGC GACTTGCAAG GCGCTTCGGC AAGAAGTCGG CAGTTGGCAT
CGCCCAGAAT TTCGTCAAGC TCGCGACACA ACCCACGATC CCGGCACGAG CTTGAGTCAT
CCCCAAGAGC GACAGCGGAC TGACCTCAAT GCCGTTTTAT TGGCGAACTG TGCGCGGGTT
CAGGAAGCCC TGCGGGTGAT TGAGGAATAC GGCAAGCTGA TCGAGGGGGA TCTCAGCGAT
CGCGCTAAGG CAATGCGCTA CCAGATCTAT GTCTTGGAGT CTCAGCTGCA AAGTCGCGAT
CGCCTCAGTC GGTTGCGACA GGCACGGCTT TATCTTGTGA CCTCGCCCCA TCCTCGCTTG
CTGGAGGTCG TTGAAGCGGC TCTTTCGGCA GGGCTGAAAT TGGTGCAGTA CCGCGACAAG
CAGCAGGAGG ATGCAACCCG CCTAGAAACC GCCTGCCGCT TGGCGGAACT CTGCCAGCGC
TACGGAGCCC TTTTCCTCGT CAACGATCGC GTCGATCTAG CACTTGCTTG CGGGGCTGAT
GGCGTTCATC TTGGTCAGCA GGATGTGCCG ATGGACGTGG CTCGCCGCAT TCTCGGCCCC
GATCGCATTG TTGGGCGATC GACGACCAGT CCTGAAGAGT TGGCCCGTGC CAATGCTGAA
GGGGCGGACT ATGTCGGCGT GGGGCCCATC TTTGCCACAC CCACGAAACC CGGCAAAGCA
GCAGCGGGCT TTGACTATTT AGGCTATGCC CGCCAGCAGG CTCAGCAGCC GTTTTATGCG
ATTGGTGGAA TCGATGTCAG CAATGCCGCT GCGGTGGTTG CAGCCGGTGC CGATCGCCTC
GCCGTGGTCC GCGCGATCAT GGAGGCGCCC GACCCGAAGG CCGCTACGGC AGAACTTTTA
CAAATGCTTT AG
 
Protein sequence
MSSESSMDWV ETRCHRILDA NLDRAREGLR ILEEWCRFGL ERADLSATCK ALRQEVGSWH 
RPEFRQARDT THDPGTSLSH PQERQRTDLN AVLLANCARV QEALRVIEEY GKLIEGDLSD
RAKAMRYQIY VLESQLQSRD RLSRLRQARL YLVTSPHPRL LEVVEAALSA GLKLVQYRDK
QQEDATRLET ACRLAELCQR YGALFLVNDR VDLALACGAD GVHLGQQDVP MDVARRILGP
DRIVGRSTTS PEELARANAE GADYVGVGPI FATPTKPGKA AAGFDYLGYA RQQAQQPFYA
IGGIDVSNAA AVVAAGADRL AVVRAIMEAP DPKAATAELL QML