Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Synpcc7942_0047 |
Symbol | |
ID | 3775760 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus elongatus PCC 7942 |
Kingdom | Bacteria |
Replicon accession | NC_007604 |
Strand | + |
Start bp | 46515 |
End bp | 48881 |
Gene Length | 2367 bp |
Protein Length | 788 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 637798453 |
Product | TPR repeat-containing protein |
Protein accession | YP_399066 |
Protein GI | 81298858 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.210348 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTTATT TTTATCTAGC TGAAGCTGGG GATTGGTCAT TTCGAACGGT TTGGCACGAT CGCTTCTTTA CTGTTCTCAA ACATCATCCG GCACGGACTG AAAAGGTTGA GCAGGCAGAC TATATTTTTC TGGATGCAGA TTTTGCCCTA GAGACCAACT GGCCGTATTA CGGATCGCAA CGTAAGGCGA TTTTAAAGGG CGATCGCTTT CCAACAGATG AAGAAATTTT AGCGTTTCTG TTTGATCATC CGGTCAGTGG CTATGGCAAG CCCTTGGTGC TAATCAATAT GAATCCATTG GCACAGTGGC CAGCGATCGC ACAGGTATTG CCGGATGTGG TGGTGGTGAG TCATTGCCAT ACACTGGGAA ACTATCGCGT TGGCATTGAT GTGAGTTTTC CTCCGATGCC ACTCTTAGAT CAGCATTGCT ATCCAAGCCG AGATCGCTCA ACTTTACTCA GTTTTCGGGG AGCTAATAGT CATCCGGTGC GGGAGCAATT ACAACGGCTT CATCAGCCAC CCGAGATTGC AGCTGAGCTG ATCCAGCAAA GCTATTGGGG GACGTTGAAT TATGTGGATG AGGCTGAGGG ACTAAGCGCT GAGCAACAAG TCTATACAGA TTTAATTGCC CGATCGCGCT TTTCTGTAGC TCCACGTGGC CATGATATTT TTTCCTATCG CTTGCTAGAG GTGATGGCAG GCGGGGCGAT TCCGGTCATT TTGGCTGATG ATTGGGTGCT ACCGTTTTCG GAGTTGCTGG ATTGGTCAGA GTTTTCGCTA TCTGTTGCTG AAGATCGCTG TTGGGAATTA CCGCAGCTCC TTCAAGCGAT TTCAACGGAT CAATGGCAGG TGATGCAGCA GCATTTGCAA CAGGTTTATC AGCACTATTT CTATTCTTTG GCACGACAGG TGCAGACGCT GTGGCAAATC TTGGATCAGC GATCGCTTCA TCCTTCTGCG TCAGAGGTAG AGATTGAGGC GGTGTTGTTG TCTCAGGCGG AACGGTATCG ACTTAAGGGA GATCTGGAAG CAGCCGCAAC TTATTTGGCA ACACTGCCCC GGTCGCCAGC TCGGATTTTA GAGCAAGCAC GCTTAGCGTT GACGCAACAG CAACCCAAAG CAGCATTGGC CTTACTGGAT TCCATGGTGA TGCCGGAGGC AGAGCAGGGA GAGCATTACA ACTTGCTGGG GGTGGCGCAA ACGCAACTCG GGGAATGGGA TACGGCGATC GCAATCTATC GACAGGGATT GCAGGCACAG CCCCACCATC CAAAATTGCG CACCAATCTT TGTGTGGCGT TGCGGCAGCA GGAACAGTGG GAAGAGGCAT TAGCTCTTAG TCAAGCACTA CTAGAAGAGG TACCGGCAGC GATCGATCGC CTACTGCTAC AAGCTGATAC GCTCAATTTG GCAGGTCGGT ATGATCAGGC GCTATCGCTA TATCAGCAGG TGGTGGCGCG AGAACCAGAA CGAGCAAATG CTCAGCTGGC GATCGCAGAA ATTTTGTTGC GGCAGGGCAA GGCAGAAGGT TGGGATATCT ATGAAGCTCG CTTTGCTGCG GAGCCGAGTT TGGCAGCGTT GGCGGCGCAT TATCCGCAAC CACGATGGCA GGGGGTAGAG CTGGGTCAGC GATCGCTATT GGTGTGGGGA GAGCAGGGCT ATGGCGATCA AATTCAGTTC AGTCGCTACT TGTGGGTGCT GCGCGATCGC TATCCGCAGG CTCGTATTCA GTTCCAGACG GATGCGGTAC TAGTGCCGTT GTTTGCGGAG CCGCTGGCAA GTTTGGGCAT TAAGGTGATT CCGGCACAGA TAACGGAAGA ATTATTTGAT TTTCAGGTGC CGTTGCTGTC GTTGCCTCGG TTGGTGTGGC CGAGCTTAAA AGATATTCCC TATCGCGAGG GTTGGTTGCC TTGTCCGTTG CCACCGCCTC GTGAGGATCA AAACCACAAG TTTCGGGTTG GAATTGTTTG GCTGGCGGGT CAGCGGGCAG GAATACAAAA GAGTGCGACA GCGGATCGGC GCAGCTGTTC GCTAGAAGCA ATGTTGGAGC TGGTGCGAGA AAGTATGCAG AGGTCAGATC TGGAAGTGGT GAGTCTGCAG TTGGGGTATG AGGGTAGGTT ACCGGAAGGG ATTCAGGATT GGAGCAATCG CTTGGTGGAT TTTTCAGCAA CGGCTCGGGT GTTGATGGAA TTGGATCTAC TGGTGACGGT GGATACTGCG ATCGTGCATT TGGCAGGGGC AATGAGAACA CCAGCAAAAG TTCTTCTTGC TTACCCAGCA GATTGGAGAT GGCAGCAAAA CTTGGAATTA ACTTGGTACT CTTCTATAGA GCTTTGTTAC TTTGCCCATC TGCTTTCAAG TTTTTAG
|
Protein sequence | MAYFYLAEAG DWSFRTVWHD RFFTVLKHHP ARTEKVEQAD YIFLDADFAL ETNWPYYGSQ RKAILKGDRF PTDEEILAFL FDHPVSGYGK PLVLINMNPL AQWPAIAQVL PDVVVVSHCH TLGNYRVGID VSFPPMPLLD QHCYPSRDRS TLLSFRGANS HPVREQLQRL HQPPEIAAEL IQQSYWGTLN YVDEAEGLSA EQQVYTDLIA RSRFSVAPRG HDIFSYRLLE VMAGGAIPVI LADDWVLPFS ELLDWSEFSL SVAEDRCWEL PQLLQAISTD QWQVMQQHLQ QVYQHYFYSL ARQVQTLWQI LDQRSLHPSA SEVEIEAVLL SQAERYRLKG DLEAAATYLA TLPRSPARIL EQARLALTQQ QPKAALALLD SMVMPEAEQG EHYNLLGVAQ TQLGEWDTAI AIYRQGLQAQ PHHPKLRTNL CVALRQQEQW EEALALSQAL LEEVPAAIDR LLLQADTLNL AGRYDQALSL YQQVVAREPE RANAQLAIAE ILLRQGKAEG WDIYEARFAA EPSLAALAAH YPQPRWQGVE LGQRSLLVWG EQGYGDQIQF SRYLWVLRDR YPQARIQFQT DAVLVPLFAE PLASLGIKVI PAQITEELFD FQVPLLSLPR LVWPSLKDIP YREGWLPCPL PPPREDQNHK FRVGIVWLAG QRAGIQKSAT ADRRSCSLEA MLELVRESMQ RSDLEVVSLQ LGYEGRLPEG IQDWSNRLVD FSATARVLME LDLLVTVDTA IVHLAGAMRT PAKVLLAYPA DWRWQQNLEL TWYSSIELCY FAHLLSSF
|
| |