Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Synpcc7942_2181 |
Symbol | |
ID | 3773738 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus elongatus PCC 7942 |
Kingdom | Bacteria |
Replicon accession | NC_007604 |
Strand | - |
Start bp | 2259980 |
End bp | 2261191 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637800626 |
Product | hypothetical protein |
Protein accession | YP_401198 |
Protein GI | 81300990 |
COG category | [R] General function prediction only |
COG ID | [COG1994] Zn-dependent proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.392153 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAAGTG GCTGGCGGAT TGGCTCCATT TTGGGCATCC CGCTCAGGAT CGATCCTTCT TGGTTTGTGA TCGTGGCACT CGTCACGTTC AGCTATTCCG AAACCTTCCG ATCGCAGCAG CCGACTTGGT CGCCGGGGCT GTTGTGGGGG GCTGCCCTCG TCATGGCCTT GCTGCTGTTT GCCTCTGTCT TGGCCCATGA GTTGGGGCAC AGTCTGATTG CCCGCGCCCA AGGGATTCGC GTCAGCTCGA TCACGCTCTT CCTCTTTGGT GGTGTCGCGG CCATTGAGCG CGAGTCGCGG ACCCCGGGGG GCGCTTTTTG GGTCGCGATC GCGGGGCCGT TGGTCAGCTT TGCCTTGGCG TTGTTACTGC TGATCAGTCA GCTGTGGTGG CCAGCAGGTT CACCAGCGCA AGTTTTATCT CTCAATCTGG GGCGACTGAA CTTTATCTTG GCGGTGTTTA ATCTCATCCC GGGGTTGCCC TTGGATGGTG GTCAGGTGCT CAAGGCGATC GCCTGGAAAG TGACGGGCGA TCGCTATCGG GCGGTGCATT GGGCTGCGAA CTCAGGTCGG ATTCTCAGTG CGATTGCCAT GGCGATCGGG CTATTTAGCT GGTTTTTGGG GCCCGGCGGT TTTAGCGGCG TGTGGCTGGC GCTGTTGGGC TGGTTTGGCT GGCGTAATGC CACGGCCTAC GATCGCACCA CCACCTTGCA ACAGGCGATC CTGGCGATCG GCGCCAGCGA AGCGATGAGT CGTCGCTATC GGGTGCTGGA AGGATCACTG ACCTTACGGC AGTTTGCGGA GCTGCTGATC ACTGAAGAGC AGGAAGGATT TGCCTACTTT GTGGCTAGTG ACGGGCGCTA TCGGGGTCGG ATTAGCTTAG CGACCCTGCG GCAAACTGAG CGATCGCAGT GGGATCGGCT GACCTTAACG GATTTAGCAG AACCGTTCGA CCGCTTGCCT GCGCTGCCCG AGACGGCCAA TCTAGCCCAA GCGATCGCTG CTTTGCAAAC AGCCCAGCCC AGCTACGTCA CAGTTCTGAC TCCCAGTGGC GCGGTCGCCG GCATCATTGA CCATGCCGAT GTGATTCAAG CCCTCGGCAA AAAACTGGGC TTTCAGCTCC CCCCTGCCGA ACTCCAGCAG ATTCGGGGCC GTGCCGCCTA TCCCGATGGA CTGCCGCTAG AAATGCTGGC GCAGTCAGTT CTAAACAACT GA
|
Protein sequence | MQSGWRIGSI LGIPLRIDPS WFVIVALVTF SYSETFRSQQ PTWSPGLLWG AALVMALLLF ASVLAHELGH SLIARAQGIR VSSITLFLFG GVAAIERESR TPGGAFWVAI AGPLVSFALA LLLLISQLWW PAGSPAQVLS LNLGRLNFIL AVFNLIPGLP LDGGQVLKAI AWKVTGDRYR AVHWAANSGR ILSAIAMAIG LFSWFLGPGG FSGVWLALLG WFGWRNATAY DRTTTLQQAI LAIGASEAMS RRYRVLEGSL TLRQFAELLI TEEQEGFAYF VASDGRYRGR ISLATLRQTE RSQWDRLTLT DLAEPFDRLP ALPETANLAQ AIAALQTAQP SYVTVLTPSG AVAGIIDHAD VIQALGKKLG FQLPPAELQQ IRGRAAYPDG LPLEMLAQSV LNN
|
| |