Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Synpcc7942_1965 |
Symbol | |
ID | 3774150 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus elongatus PCC 7942 |
Kingdom | Bacteria |
Replicon accession | NC_007604 |
Strand | + |
Start bp | 2035730 |
End bp | 2037379 |
Gene Length | 1650 bp |
Protein Length | 549 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637800408 |
Product | exopolyphosphatase |
Protein accession | YP_400982 |
Protein GI | 81300774 |
COG category | [F] Nucleotide transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0248] Exopolyphosphatase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.363541 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACCG AGATTGCGCC CCACGCCCGT CCAGCACTCC CCAGTGAGCT CGGGCCAGAA CCGATTTTGG CAGCAATCGA TGTGGGCACA AACTCCATTC ACATGGTGGT GGTCCAGATT CGTCCCGACC TGCCGGCCTT CGATATCGTC GGCACCGAGA AAGCCACCGT CCGACTGGGC GATCGCGATC CTGCCACGGG TGAGTTGACC GAGGCGGCGA TGGAACGGGC ACTCCAAGCG CTGCAACGCT GTCGAGCGAT CGCGGCGGCT CGGGGGGCCA CGCAAATTGT GGCTGTCGCC ACCAGTGCTG TGCGAGAGGC TCCCAATGGT CGCCAATTTC TCGAGCGGGT CATCCATGAA GTCGGTTTAG AAGTCGATCT GATCTCGGGG CCAGAGGAAG CACGACGCAT CTACCTTGGG GTGCTGTCCG GCATGGATTT TCAGCAGCGA CACCACGCCA TCATTGATAT TGGCGGTGGA TCGACCGAGC TGATTCTGGG AGCGGGGCAA GAGCCGCTCT GCCTGACCAG CACGAAGGTT GGCGCCGTCC GTCTCACCCA AGAATTCATT CACACCGACC CAATTAGCCC CAGCGAGTAC ACCGTGCTGC AAGCCTACGT GCGCGGCATG GTCGAACGGG CGGTCGATGA AGTCAAAGCC GCTCTACCAC CCAACACGCC CCTACGACTG ATCGGCACTT CCGGCACCAT TCAAGCCCTC GCCGCCCTCC ATGCCCACCA AAGCCAAGGC AGCTTGCCGA CCACCTTCAA TGGCTACAGC TTAGCTCTTG GCGATCTGCA GCAGTTGGTG CAGCAGTTGC GGCGTATGCC CTTTGCCGAA CGTCAGACTT TGCCCGAGCT GTCGGAACGA CGAGCTGAGA TTATCGTTGC AGGGGCGATC GTGCTGCAGG AAACCATGCA GCTGTTGGGT TGCGATCGCG TCACCATCTG CGAACGGGCC TTGCGCGAGG GGCTGATCGT CGACTGGATG CTCAGCCACG GACTGATCGA AGACAAGCTG CGCTATCAGG GATCGGTGCG GCAGCGCAGT GTCTACAACC AAGCGCGCAA ATTTCGGGTA GATGTCAGTC ACGGCGAGCA AGTGGCGCAA TTAGCCCTCA GCCTGTTTGA CCAACTGCGC GGACAACTAC ACCAATGGGG CGAGAGCGAA CGGGAACTCC TTTGGGCAGC CGCCATCCTG CACAATTGCG GCCATCACAT CGACCACTCC TCCCATCACA AGCATTCCTA TTACCTGATC CGCCACGGGG GTCTGCTGGG CTACAACGAG ACGGAGATTG AACTGATTGC TAATCTGGCT CGCTATCACC GCAAGAGCCT GCCCAAGAAA AAGCACGAAA ACTTCCGCAC CCTGCCCACC AAGGAGCAAC GGCGCTTGGT AGAACAACTC AGCGCGATTC TGCGGGCGGC AGTTGCCCTC GATCGCCGGC AGGTAGGAGC GATCGCGAGT CTTCATTGCC GCTACCTTGC TCCGCAACGG CAGCTTCTTC TTCAACTGCA TCCGGCTCGA ACCAGTGAGG ACTGCGCCCT CGAACTCTGG AGTTTTGACT ACAACCGCCA TGCTTTAGAA GCCGCGTTTG CAATCAATGT CGCCGCAGAA CTGGTGCCGC AGCCGCTCAG CGCAGTCTAG
|
Protein sequence | MSTEIAPHAR PALPSELGPE PILAAIDVGT NSIHMVVVQI RPDLPAFDIV GTEKATVRLG DRDPATGELT EAAMERALQA LQRCRAIAAA RGATQIVAVA TSAVREAPNG RQFLERVIHE VGLEVDLISG PEEARRIYLG VLSGMDFQQR HHAIIDIGGG STELILGAGQ EPLCLTSTKV GAVRLTQEFI HTDPISPSEY TVLQAYVRGM VERAVDEVKA ALPPNTPLRL IGTSGTIQAL AALHAHQSQG SLPTTFNGYS LALGDLQQLV QQLRRMPFAE RQTLPELSER RAEIIVAGAI VLQETMQLLG CDRVTICERA LREGLIVDWM LSHGLIEDKL RYQGSVRQRS VYNQARKFRV DVSHGEQVAQ LALSLFDQLR GQLHQWGESE RELLWAAAIL HNCGHHIDHS SHHKHSYYLI RHGGLLGYNE TEIELIANLA RYHRKSLPKK KHENFRTLPT KEQRRLVEQL SAILRAAVAL DRRQVGAIAS LHCRYLAPQR QLLLQLHPAR TSEDCALELW SFDYNRHALE AAFAINVAAE LVPQPLSAV
|
| |