Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_2816 |
Symbol | |
ID | 8392143 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | + |
Start bp | 2844953 |
End bp | 2846911 |
Gene Length | 1959 bp |
Protein Length | 652 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 644980768 |
Product | pentapeptide repeat protein |
Protein accession | YP_003138503 |
Protein GI | 257060615 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.300726 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCCAT TGCCTTCCAG TAATAATCTA CGTGGACAAG ATCTTAGAAA TGGAGATTAT GCTAGGCAAG ATTTTAGATA TAGGGACATC CGGGGGACAA ACTTTTCTAA TGCCAATCTT GAGGGCGCAG ATTTTACTGG GGCAACGGCT GGAGTAACCT ATTCCTGGGC ATTGTGTTTG CTACTTACTG CTTTATTTAT TGCTTCACTA TCTGGCTTTA CAGCTTCAAT AATTATTACT TTTACAGTTT ATTTTTTAGT GGGATCGAAG ATTCCTCGTC CTATAGCTCT ATTAATTGCA GTTAGTTTTA TATCTCTTGT TCGCGCAGGA TTAGATCACC ATTTTGGAGC TTTTGTTGTC GCCTTAAATA TTTATTTTTC AATAGCCTTA ATTTTAGCTG TTGCCCTTTT TGGAGCCACC GCAGCAGCAG TTGACGAACC TAAATCCAGT ATTGGCAGTA CGGCTTTAGC TACTTTTGCA TGTATGGCTC TTCTAATCGT AATAGTTGTG AATGTAATAT CAAATATACA GCCAGTTGGT GGAATAATTG GAGGGGTAGT AGGAGGCATA TTTGGGGGGA TAATTGGTAG TTATTTTGGT CGTAAGGCAA TAGACGGAGA TAGTAAGTTC TGCTGGATTT GGAAGCTTTA TCTTCGATTT GCCATAAAAG GTGGCACAAA ATTTCAAGGT ACTAACTTAA CCAATGCAAT CTTCACTAAT ACAATCTTGA AAGGTTCTGA GTTGAGAGAT GCAGTCATCA CAAACGCAAA TTGGAAAAGT GCTAAGTTTC TTGAATTGTC TAGATTTGAC AATTCGCTCT TAGCTAATTC TAAAGTTAGA GAGCTTCTTG TTACAGGCTT AGGTAACAAT CAGAATTATG CTGGGCTGGA TTTAAGAGGC ATAAATCTTT CACAAGCAAA GCTAAACGGG GCTAATCTTA AGAATTCTGA TATTAGTGGA TCTATATTAA TCGGAGCGGA TTTACAGTTT GCTAATCTTG CAGAGGTGAA AGCTACTAAT ACAGATTTTA CTCATGCCTT CTTAAACGGA GCTTGTATTG AGAATTGGCA TATAAACTCA AGAACAAAAT TTGAAGATGT AAAGTGCGAC CACGTTTATT TGCGTGAGAA CAAACAAGAG CGTCGTCCCT CTAACTCTGA TGATTATTTT GAGGGCAGTG AATTCATCAC TTTGGTGGAG AAATATCATG AAACGATTGA CTTGATTTTT AATAATGGAA TTGACTGGAC GGCACTACTC ACAAGTATCT ATAAGCTTCA AACCAAAAGT AGGGATAGCA ATTTGTCAAT TCAGGCAATT GAACGTAAAC AAGGTGGTTT TTTTGTTGTT CGAGTTGATG TTCCGCAGAA CCTAGATAAG GCAGAAGCAG AGAGATTTTT ATGGGACAAG TACAAAAAAA AGCTGAAAGA AATTGAAGAA AGTTATTCAG CTAGGCTTAG CCTTAAGGAT GAAAAACTGA GTGTTTATTT GCAGCAAATT AATGATTATC GTCAACAAAA CACTAGCTTG GTGGAATTAG TCAAGCAAAA AGCAGTAAGC GAAAAAATTC AGATTGAAAA TAAAATCGAG AATATAAACA TTCAACAAGG AGAACAGCAC AGTATGAGCA GTCTCAATCA TTATGGTACT GGCGATAACA TTGCGGGCGA TAAGGTTATG GGTGACAAAA TTGAGACTCA AATCAACAAC AATCAAGATT TAGCTCAGGC TTCCAAAGAC ATCAAAGCCC TGCTAGAACA GCTATCTGTG GACTATCCTA GTGACAGCCC TAGAGTTTTA GGAGCAAAGG CTGTGGATGA AGTTGAAAAA AATCCAGAGA TGAAGTCTCG AATTTTGCGA GGAGTTAAAG CAGGTAGTTT TGCAGCTTTA GAAAAAATGA TTGATCACCC TGTCGCCAAA TTTTTCATTG AAGGAGCAAA AGAAGTCCTG AAGCCTTGA
|
Protein sequence | MPPLPSSNNL RGQDLRNGDY ARQDFRYRDI RGTNFSNANL EGADFTGATA GVTYSWALCL LLTALFIASL SGFTASIIIT FTVYFLVGSK IPRPIALLIA VSFISLVRAG LDHHFGAFVV ALNIYFSIAL ILAVALFGAT AAAVDEPKSS IGSTALATFA CMALLIVIVV NVISNIQPVG GIIGGVVGGI FGGIIGSYFG RKAIDGDSKF CWIWKLYLRF AIKGGTKFQG TNLTNAIFTN TILKGSELRD AVITNANWKS AKFLELSRFD NSLLANSKVR ELLVTGLGNN QNYAGLDLRG INLSQAKLNG ANLKNSDISG SILIGADLQF ANLAEVKATN TDFTHAFLNG ACIENWHINS RTKFEDVKCD HVYLRENKQE RRPSNSDDYF EGSEFITLVE KYHETIDLIF NNGIDWTALL TSIYKLQTKS RDSNLSIQAI ERKQGGFFVV RVDVPQNLDK AEAERFLWDK YKKKLKEIEE SYSARLSLKD EKLSVYLQQI NDYRQQNTSL VELVKQKAVS EKIQIENKIE NINIQQGEQH SMSSLNHYGT GDNIAGDKVM GDKIETQINN NQDLAQASKD IKALLEQLSV DYPSDSPRVL GAKAVDEVEK NPEMKSRILR GVKAGSFAAL EKMIDHPVAK FFIEGAKEVL KP
|
| |