Gene Synpcc7942_0047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_0047 
Symbol 
ID3775760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp46515 
End bp48881 
Gene Length2367 bp 
Protein Length788 aa 
Translation table11 
GC content51% 
IMG OID637798453 
ProductTPR repeat-containing protein 
Protein accessionYP_399066 
Protein GI81298858 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.210348 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTATT TTTATCTAGC TGAAGCTGGG GATTGGTCAT TTCGAACGGT TTGGCACGAT 
CGCTTCTTTA CTGTTCTCAA ACATCATCCG GCACGGACTG AAAAGGTTGA GCAGGCAGAC
TATATTTTTC TGGATGCAGA TTTTGCCCTA GAGACCAACT GGCCGTATTA CGGATCGCAA
CGTAAGGCGA TTTTAAAGGG CGATCGCTTT CCAACAGATG AAGAAATTTT AGCGTTTCTG
TTTGATCATC CGGTCAGTGG CTATGGCAAG CCCTTGGTGC TAATCAATAT GAATCCATTG
GCACAGTGGC CAGCGATCGC ACAGGTATTG CCGGATGTGG TGGTGGTGAG TCATTGCCAT
ACACTGGGAA ACTATCGCGT TGGCATTGAT GTGAGTTTTC CTCCGATGCC ACTCTTAGAT
CAGCATTGCT ATCCAAGCCG AGATCGCTCA ACTTTACTCA GTTTTCGGGG AGCTAATAGT
CATCCGGTGC GGGAGCAATT ACAACGGCTT CATCAGCCAC CCGAGATTGC AGCTGAGCTG
ATCCAGCAAA GCTATTGGGG GACGTTGAAT TATGTGGATG AGGCTGAGGG ACTAAGCGCT
GAGCAACAAG TCTATACAGA TTTAATTGCC CGATCGCGCT TTTCTGTAGC TCCACGTGGC
CATGATATTT TTTCCTATCG CTTGCTAGAG GTGATGGCAG GCGGGGCGAT TCCGGTCATT
TTGGCTGATG ATTGGGTGCT ACCGTTTTCG GAGTTGCTGG ATTGGTCAGA GTTTTCGCTA
TCTGTTGCTG AAGATCGCTG TTGGGAATTA CCGCAGCTCC TTCAAGCGAT TTCAACGGAT
CAATGGCAGG TGATGCAGCA GCATTTGCAA CAGGTTTATC AGCACTATTT CTATTCTTTG
GCACGACAGG TGCAGACGCT GTGGCAAATC TTGGATCAGC GATCGCTTCA TCCTTCTGCG
TCAGAGGTAG AGATTGAGGC GGTGTTGTTG TCTCAGGCGG AACGGTATCG ACTTAAGGGA
GATCTGGAAG CAGCCGCAAC TTATTTGGCA ACACTGCCCC GGTCGCCAGC TCGGATTTTA
GAGCAAGCAC GCTTAGCGTT GACGCAACAG CAACCCAAAG CAGCATTGGC CTTACTGGAT
TCCATGGTGA TGCCGGAGGC AGAGCAGGGA GAGCATTACA ACTTGCTGGG GGTGGCGCAA
ACGCAACTCG GGGAATGGGA TACGGCGATC GCAATCTATC GACAGGGATT GCAGGCACAG
CCCCACCATC CAAAATTGCG CACCAATCTT TGTGTGGCGT TGCGGCAGCA GGAACAGTGG
GAAGAGGCAT TAGCTCTTAG TCAAGCACTA CTAGAAGAGG TACCGGCAGC GATCGATCGC
CTACTGCTAC AAGCTGATAC GCTCAATTTG GCAGGTCGGT ATGATCAGGC GCTATCGCTA
TATCAGCAGG TGGTGGCGCG AGAACCAGAA CGAGCAAATG CTCAGCTGGC GATCGCAGAA
ATTTTGTTGC GGCAGGGCAA GGCAGAAGGT TGGGATATCT ATGAAGCTCG CTTTGCTGCG
GAGCCGAGTT TGGCAGCGTT GGCGGCGCAT TATCCGCAAC CACGATGGCA GGGGGTAGAG
CTGGGTCAGC GATCGCTATT GGTGTGGGGA GAGCAGGGCT ATGGCGATCA AATTCAGTTC
AGTCGCTACT TGTGGGTGCT GCGCGATCGC TATCCGCAGG CTCGTATTCA GTTCCAGACG
GATGCGGTAC TAGTGCCGTT GTTTGCGGAG CCGCTGGCAA GTTTGGGCAT TAAGGTGATT
CCGGCACAGA TAACGGAAGA ATTATTTGAT TTTCAGGTGC CGTTGCTGTC GTTGCCTCGG
TTGGTGTGGC CGAGCTTAAA AGATATTCCC TATCGCGAGG GTTGGTTGCC TTGTCCGTTG
CCACCGCCTC GTGAGGATCA AAACCACAAG TTTCGGGTTG GAATTGTTTG GCTGGCGGGT
CAGCGGGCAG GAATACAAAA GAGTGCGACA GCGGATCGGC GCAGCTGTTC GCTAGAAGCA
ATGTTGGAGC TGGTGCGAGA AAGTATGCAG AGGTCAGATC TGGAAGTGGT GAGTCTGCAG
TTGGGGTATG AGGGTAGGTT ACCGGAAGGG ATTCAGGATT GGAGCAATCG CTTGGTGGAT
TTTTCAGCAA CGGCTCGGGT GTTGATGGAA TTGGATCTAC TGGTGACGGT GGATACTGCG
ATCGTGCATT TGGCAGGGGC AATGAGAACA CCAGCAAAAG TTCTTCTTGC TTACCCAGCA
GATTGGAGAT GGCAGCAAAA CTTGGAATTA ACTTGGTACT CTTCTATAGA GCTTTGTTAC
TTTGCCCATC TGCTTTCAAG TTTTTAG
 
Protein sequence
MAYFYLAEAG DWSFRTVWHD RFFTVLKHHP ARTEKVEQAD YIFLDADFAL ETNWPYYGSQ 
RKAILKGDRF PTDEEILAFL FDHPVSGYGK PLVLINMNPL AQWPAIAQVL PDVVVVSHCH
TLGNYRVGID VSFPPMPLLD QHCYPSRDRS TLLSFRGANS HPVREQLQRL HQPPEIAAEL
IQQSYWGTLN YVDEAEGLSA EQQVYTDLIA RSRFSVAPRG HDIFSYRLLE VMAGGAIPVI
LADDWVLPFS ELLDWSEFSL SVAEDRCWEL PQLLQAISTD QWQVMQQHLQ QVYQHYFYSL
ARQVQTLWQI LDQRSLHPSA SEVEIEAVLL SQAERYRLKG DLEAAATYLA TLPRSPARIL
EQARLALTQQ QPKAALALLD SMVMPEAEQG EHYNLLGVAQ TQLGEWDTAI AIYRQGLQAQ
PHHPKLRTNL CVALRQQEQW EEALALSQAL LEEVPAAIDR LLLQADTLNL AGRYDQALSL
YQQVVAREPE RANAQLAIAE ILLRQGKAEG WDIYEARFAA EPSLAALAAH YPQPRWQGVE
LGQRSLLVWG EQGYGDQIQF SRYLWVLRDR YPQARIQFQT DAVLVPLFAE PLASLGIKVI
PAQITEELFD FQVPLLSLPR LVWPSLKDIP YREGWLPCPL PPPREDQNHK FRVGIVWLAG
QRAGIQKSAT ADRRSCSLEA MLELVRESMQ RSDLEVVSLQ LGYEGRLPEG IQDWSNRLVD
FSATARVLME LDLLVTVDTA IVHLAGAMRT PAKVLLAYPA DWRWQQNLEL TWYSSIELCY
FAHLLSSF