Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Syncc9902_2034 |
Symbol | |
ID | 3742994 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus sp. CC9902 |
Kingdom | Bacteria |
Replicon accession | NC_007513 |
Strand | + |
Start bp | 1941817 |
End bp | 1943634 |
Gene Length | 1818 bp |
Protein Length | 605 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 637772231 |
Product | TPR repeat-containing protein |
Protein accession | YP_378035 |
Protein GI | 78185601 |
COG category | [S] Function unknown |
COG ID | [COG3898] Uncharacterized membrane-bound protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.651462 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTCAA CCAACCAACC ACTTCAAGCT CTCGAAGCTT CCATACAATT AGCCCGCCAA AACTATAAAG AAGCGAAAAT CGACTCAGCC CTCCAACACA TCAAAGAAGG GCTACAGATT GATGAGCACA ATGCCCAATT ACTCGAGATT GCTGCCAGCA CTCATCTTCG TTTCAATAAT AACAATGAAG CCATTAAATA TGCTCAAGAA TTAATCACAC ACCATCCAAG AAATCCGCAT GGATACAGCA GATACGCACA AGCCCTGCTG CAACAGAAAC AACCTTTTAA TGCCAATCAA ATGGCACTTA GCGGATTACA AGAAGCTCCA CACAATCAAC AGCTCCTGGC ATTAACTAGC GAATCTTATC AAGCGCTAGA GCGCTGGGAT CAATCACTAG AACTGGCAAA TCAACTTATT GAATTCCATC CAAGCCTGCA CATGGGATAC CTCAAGGCAG CACAAAACCT TCTAAAACTC AACAAACCCA ACGAGGCTTC TTCTATCGCA GAACGAGGCT TAAAAGCAAA GCCGAACCAC CCTCATCTTC ATAGCATTGC CAGCGAAGCC TACCGAGCCA GACACCTACA CAATCACTCA CTCAATCACG CAAAAGCCTT AATTGAAAAT CAACCAGACT CCATCGAAGG GCATAAACGA GCAGCCCAAG ACTTACTGAA ACTAGGGCTA AGGAATGAGG CAATTTTAAT CATCGAGCAA CTCATTCAAC GAAACAAGTC CAAGAAAGCC GCGGCAATGG CCAGCAAGCT CTTCAAAGTA GCCGGACAAA CATCAAGAAG CCTCGTTCTT ACAACACAAC TCGCCAAAGC AAACGATGCA ACAGATGATG ATCAACGGCA ATGGATTAGC AATTTATTCT CATGTAGACA GATTGATCTT GCACTATCTA AAATTGAATG CAACAACCCA ATCGATGCAG AAATACAGAG CAATACTCTT CGAGCTCTCC TCAGCCAACC ACTCAATACA TTTAAGAAAC TATCAAAGCA TGAGCGGAAC TTAATCAGAG AATACGATAT CTATTATCAC CTATCTCTTC CTAACTTTAA TCCAACTCTG GAAGATTTAG AACAACGGCT CAAATCCACC AACAAGATCA TCCTACTTGT GGTGCATGTT GGGAAATGCG CAGGCGAATC AATCATCACC GCACTAGAAG AAACCTTCAC CTCAAACGAA GTGGAAGTCA TCGAATATCA CACTTTTGAT AGCAATATGC TGATCAGAGA AAGTCTTCCA TTGCTCCACA AGCATTCCGA TCGAATCCAC ATTGTGACTT GCACTCGCAA TCCTGTGGAT CGCTGGATAT CTGCTTTCAA CTGGGACTAC CACACATTTT TCTTATCAAG CCAATTCTAT TGTCCAGACC ACATCATTCA ATTACATCGC CAATACTTCT CCGCCCTAGA ATTAACCAAT GGCTTGATGC GGAAGGAAAT AGAGGCCCAT GAATTGGCCA CATTCAAACA CCTTGCCTAT GGACACATGG CGAAAGGAAT CTCCTGGTAC CTACCAGAAG AAATTATCGA CAATCTTCCT AAGCAAGCAA TATCGACAAT CAATGTTGAA ACAATCCAAA ATGACTTCAA TCAATGCATT CATACAATCA CAACAACTTT CAAACAAATA GGTAAACGGG AACCAACACC CATCCCAAAA ACCAAGCAAA ATTATCAACA CTGGTACAAA TCAGGGGCAT TTAGTGCCGC CAGACAATTC AGCGATGCAC AAAGACAGTT CCTCGAACAA TTCCTAAATG AAGATTACAA AGTAAACAAT AAATTGAATA AGATTTAG
|
Protein sequence | MSSTNQPLQA LEASIQLARQ NYKEAKIDSA LQHIKEGLQI DEHNAQLLEI AASTHLRFNN NNEAIKYAQE LITHHPRNPH GYSRYAQALL QQKQPFNANQ MALSGLQEAP HNQQLLALTS ESYQALERWD QSLELANQLI EFHPSLHMGY LKAAQNLLKL NKPNEASSIA ERGLKAKPNH PHLHSIASEA YRARHLHNHS LNHAKALIEN QPDSIEGHKR AAQDLLKLGL RNEAILIIEQ LIQRNKSKKA AAMASKLFKV AGQTSRSLVL TTQLAKANDA TDDDQRQWIS NLFSCRQIDL ALSKIECNNP IDAEIQSNTL RALLSQPLNT FKKLSKHERN LIREYDIYYH LSLPNFNPTL EDLEQRLKST NKIILLVVHV GKCAGESIIT ALEETFTSNE VEVIEYHTFD SNMLIRESLP LLHKHSDRIH IVTCTRNPVD RWISAFNWDY HTFFLSSQFY CPDHIIQLHR QYFSALELTN GLMRKEIEAH ELATFKHLAY GHMAKGISWY LPEEIIDNLP KQAISTINVE TIQNDFNQCI HTITTTFKQI GKREPTPIPK TKQNYQHWYK SGAFSAARQF SDAQRQFLEQ FLNEDYKVNN KLNKI
|
| |