Gene Cyan8802_3620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_3620 
Symbol 
ID8392962 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp3693565 
End bp3696417 
Gene Length2853 bp 
Protein Length950 aa 
Translation table11 
GC content40% 
IMG OID644981549 
ProductTetratricopeptide domain protein 
Protein accessionYP_003139271 
Protein GI257061383 
COG category[S] Function unknown 
COG ID[COG4995] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAAAA AATACCCTAA GCTCAAACAA ATATTTAGAT ATTTGAGCAA AAAATTCTAC 
AATATAAGAG CCTTGCTTGT ATTGATTTTG CTTTTTTTTA TCTCTTTAAT AGCCCCTATT
GTAATAACGC AAGCTGCCCT TTCCTATTCA AGCAATCCAC TTGAATTAAT ACAAAAAGGA
CAACAATTAT ATCAATCACA ACAATTCTCA GAAGCAGTAA AAATTTGGCA ACAAGCAGCC
GATCTTTTTC GTGAACAAGG GGATGTATTA AACCAAAGTA TGTCTTTGAG TAATGTATCC
TTAAGTTATC AACAGTTAGG AGAATGGGAG GAAGCAAAAA CCGCGATCGC TCAAAGTTTA
TCCCTCCTAG ATAATCAAGA AAAGACGACC GTACAGCAGG GAATTTTGGC AGCCAGTTTA
GACATTGACG GACAGTTAAA ATTAGCATTA GGACAACCCC AAAATGCCCT TGAAACTTGG
CAGAAAGCTG CTAGTATTTA TCAAAAAAAT AGTCATCAAA CCCAAGTTAT TCAAAGTCTA
ATTAACCAAG CTCAAGCGAT GCAACAGTTA GGATTTTATC CAAGAGCTTG TCAGACCTTA
TTAGACGCTT TAAACATCAA TAGTCAAGAC TGTCAAATAT CCGAAGAAAA ACTACAACTT
ATTCCCCAAG AAAATATAGC AATTTCTTTA CAAATATTGG GATTGCGGAG TCTAGGGAAT
GTCTTACGAG TAACGGGACA AACTAAACAA TCTCAAAAGG TTTTATTAAA AAGTTTCCAA
TTAGCCCAAC CTCAAAAAGA TCCTGAAATC TTAGCCACTA TTGCCCTGAG TTTAGGGAAT
ACTGCTCAAA TTTTAGGCAA CCAAACCCCT GCGAGAAATC CCCAACCAAG AACAATAGAG
TTACGCAGTG AAATCAGTTG TATTCCCTCG CAAACCTATC AAACCTCTGA AGAATTTTAT
CAACAAGCGA TCGCTTGCTA TCGTCAAGCT CAAATAGGAA GTAGTTCGCT AACTAAAATA
CAAGCACAGC TAAATTTATT AAGTTTATTA ATCCAACAAC AGCAAGGAAA AGAAATTCCT
CTTTTAATTA ATCAAATTGA AGACAAACTA ACGGTATTAC CATCCAGTCA AAAAACGATT
TTAATTAAAC TAAAATTTGT ACAACAATTA ATGTGTCTTC AAAATTCTTT TCAGGCTAGT
TTCAATCAAC TAACCCCACC AATTTTACAA TCTTGTTCCA TCGTCAAACC AGGTTTAAAA
AGTCATCTGA CCCAAGAACA GATATCCTCT TGGTTAACCA TCCAAAACCT TCTAGAAACA
ACCCTTAACC AAGCAAAAAA CATCGAGCAT GACCCCTCTC AAGCGAATGT TTTAGGGTAT
TTAGGTGCCA GCTATCAACT AATAGGAAAT TTAACAAAAG CACAAGAGTT AACCGAACTT
GCCTTGCAAA AGGTTTCTGG CTTTAATTAT CCTGAAATCG CCTACTTGTG GCAATGGCAA
CTAGGGAGAT TGTACCAACT CCAAGGAGAA AATACCAAAG CGATCGCAGC CTATAGGTTA
ACCGTGGATA TCCTTGAGTC GCTACGACAG GACTTAGTGG TGACGAATAC AGACCTTCAA
TTTGACTTCC GTGATAGCGT AGAACCGGTC TATCGGGAAC TGGTGGCTCA ACTCTTGCAA
CCGTCTTCTA ATCAACCGGA AAACCCAGAA AAAATTAGTC AGGAAAACCT AAAAAAAGCC
CGCGACCTTA TAGAATCACT ACAATTAGCT GAACTCAATA ACTTCTTTCG CGAAGCCTGT
ATTGAGGCGC AACCACAACA AATTGAGCAA ATTGATCCCC ACGCAGCCGT TATCTATTCC
ATTGTTTTAC CCGAACGTCT GGCGGTAATT TTATCGGTTC CGAACCAGCC CTTAAGCTAC
CACGAAACAA CCCTCAATAA GCAATTCAAT CAGACTTCAT CGAGGGAAAT AGAAGAAGTT
TTTGATGAGA TGTTTGCCAA CTTAAACCCC TTTATTCCTA GTCCCGACCC CCTTCGTCCC
CATCAACAAT TTTATAATTG GCTCATTCGT CCCTTAGAAA CAGAATTAAA AGAAAATAAC
ATTAAAACTT TAGTTTTTGT TCTCGATGGA CTGTTACGAG GTGTACCCAT GGCTGCCCTA
CACGATGGGA AGGAGTATTT AATTGAAAAA TACCGTATTG CCCTGACTCC GGGGTTACAA
TTATTAAGTC CGCGATCGCT GTCTCAAGAA AAGTTAAAAA TCCTGGCCGG TGGATTAGCA
GAAGCCCGTC AGGGTTTCTC CGCCTTGCCT GGAGTCACCC AAGAGGTTAA AGACATCTCT
GAAATTCTCC CCGCAGAAAT CCTCTTAAAT CAAGAATTTA CTCGCCCGCG TCTACAAACC
GAAATTGAGT CCACGTCCTT TCCCATTGTT CATCTGGCAA CCCATGGTCA GTTTTCTTCC
CAAGTTGAAG AGACGTTTTT GCTGACATGG GATGAACGGA TTAATGTGAA AAACTTAGAT
CAATTGTTAC GCGAACGGGA AGAAAAACAG CAAACTCCGA TCGAAATGTT GATTTTAAGT
GCTTGTCAGA CTGCTACTGG CGACAAACGA GCCGTATTAG GATTAGCCGG GGTTGCAGTG
CGTTCAGGAG CCCGCAGTAC CATCGCCACC CTTTGGTCAG TCCAAGACCA GTCTACGGCG
AATTTGATGG CCGAGTTCTA TAAAATCTTA AATCAACCCG GAATCACTAA AGCTGAGGCA
CTACGTCAAG CACAACTGTC TTTATTGCAC TCTGGGGAAT ATCACCATGC TTTCTATTGG
GCTCCCTTTG TTTTAGTAGG AAATTGGTTA TAA
 
Protein sequence
MQKKYPKLKQ IFRYLSKKFY NIRALLVLIL LFFISLIAPI VITQAALSYS SNPLELIQKG 
QQLYQSQQFS EAVKIWQQAA DLFREQGDVL NQSMSLSNVS LSYQQLGEWE EAKTAIAQSL
SLLDNQEKTT VQQGILAASL DIDGQLKLAL GQPQNALETW QKAASIYQKN SHQTQVIQSL
INQAQAMQQL GFYPRACQTL LDALNINSQD CQISEEKLQL IPQENIAISL QILGLRSLGN
VLRVTGQTKQ SQKVLLKSFQ LAQPQKDPEI LATIALSLGN TAQILGNQTP ARNPQPRTIE
LRSEISCIPS QTYQTSEEFY QQAIACYRQA QIGSSSLTKI QAQLNLLSLL IQQQQGKEIP
LLINQIEDKL TVLPSSQKTI LIKLKFVQQL MCLQNSFQAS FNQLTPPILQ SCSIVKPGLK
SHLTQEQISS WLTIQNLLET TLNQAKNIEH DPSQANVLGY LGASYQLIGN LTKAQELTEL
ALQKVSGFNY PEIAYLWQWQ LGRLYQLQGE NTKAIAAYRL TVDILESLRQ DLVVTNTDLQ
FDFRDSVEPV YRELVAQLLQ PSSNQPENPE KISQENLKKA RDLIESLQLA ELNNFFREAC
IEAQPQQIEQ IDPHAAVIYS IVLPERLAVI LSVPNQPLSY HETTLNKQFN QTSSREIEEV
FDEMFANLNP FIPSPDPLRP HQQFYNWLIR PLETELKENN IKTLVFVLDG LLRGVPMAAL
HDGKEYLIEK YRIALTPGLQ LLSPRSLSQE KLKILAGGLA EARQGFSALP GVTQEVKDIS
EILPAEILLN QEFTRPRLQT EIESTSFPIV HLATHGQFSS QVEETFLLTW DERINVKNLD
QLLREREEKQ QTPIEMLILS ACQTATGDKR AVLGLAGVAV RSGARSTIAT LWSVQDQSTA
NLMAEFYKIL NQPGITKAEA LRQAQLSLLH SGEYHHAFYW APFVLVGNWL