Gene Cyan8802_2816 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_2816 
Symbol 
ID8392143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp2844953 
End bp2846911 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content38% 
IMG OID644980768 
Productpentapeptide repeat protein 
Protein accessionYP_003138503 
Protein GI257060615 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.300726 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCCAT TGCCTTCCAG TAATAATCTA CGTGGACAAG ATCTTAGAAA TGGAGATTAT 
GCTAGGCAAG ATTTTAGATA TAGGGACATC CGGGGGACAA ACTTTTCTAA TGCCAATCTT
GAGGGCGCAG ATTTTACTGG GGCAACGGCT GGAGTAACCT ATTCCTGGGC ATTGTGTTTG
CTACTTACTG CTTTATTTAT TGCTTCACTA TCTGGCTTTA CAGCTTCAAT AATTATTACT
TTTACAGTTT ATTTTTTAGT GGGATCGAAG ATTCCTCGTC CTATAGCTCT ATTAATTGCA
GTTAGTTTTA TATCTCTTGT TCGCGCAGGA TTAGATCACC ATTTTGGAGC TTTTGTTGTC
GCCTTAAATA TTTATTTTTC AATAGCCTTA ATTTTAGCTG TTGCCCTTTT TGGAGCCACC
GCAGCAGCAG TTGACGAACC TAAATCCAGT ATTGGCAGTA CGGCTTTAGC TACTTTTGCA
TGTATGGCTC TTCTAATCGT AATAGTTGTG AATGTAATAT CAAATATACA GCCAGTTGGT
GGAATAATTG GAGGGGTAGT AGGAGGCATA TTTGGGGGGA TAATTGGTAG TTATTTTGGT
CGTAAGGCAA TAGACGGAGA TAGTAAGTTC TGCTGGATTT GGAAGCTTTA TCTTCGATTT
GCCATAAAAG GTGGCACAAA ATTTCAAGGT ACTAACTTAA CCAATGCAAT CTTCACTAAT
ACAATCTTGA AAGGTTCTGA GTTGAGAGAT GCAGTCATCA CAAACGCAAA TTGGAAAAGT
GCTAAGTTTC TTGAATTGTC TAGATTTGAC AATTCGCTCT TAGCTAATTC TAAAGTTAGA
GAGCTTCTTG TTACAGGCTT AGGTAACAAT CAGAATTATG CTGGGCTGGA TTTAAGAGGC
ATAAATCTTT CACAAGCAAA GCTAAACGGG GCTAATCTTA AGAATTCTGA TATTAGTGGA
TCTATATTAA TCGGAGCGGA TTTACAGTTT GCTAATCTTG CAGAGGTGAA AGCTACTAAT
ACAGATTTTA CTCATGCCTT CTTAAACGGA GCTTGTATTG AGAATTGGCA TATAAACTCA
AGAACAAAAT TTGAAGATGT AAAGTGCGAC CACGTTTATT TGCGTGAGAA CAAACAAGAG
CGTCGTCCCT CTAACTCTGA TGATTATTTT GAGGGCAGTG AATTCATCAC TTTGGTGGAG
AAATATCATG AAACGATTGA CTTGATTTTT AATAATGGAA TTGACTGGAC GGCACTACTC
ACAAGTATCT ATAAGCTTCA AACCAAAAGT AGGGATAGCA ATTTGTCAAT TCAGGCAATT
GAACGTAAAC AAGGTGGTTT TTTTGTTGTT CGAGTTGATG TTCCGCAGAA CCTAGATAAG
GCAGAAGCAG AGAGATTTTT ATGGGACAAG TACAAAAAAA AGCTGAAAGA AATTGAAGAA
AGTTATTCAG CTAGGCTTAG CCTTAAGGAT GAAAAACTGA GTGTTTATTT GCAGCAAATT
AATGATTATC GTCAACAAAA CACTAGCTTG GTGGAATTAG TCAAGCAAAA AGCAGTAAGC
GAAAAAATTC AGATTGAAAA TAAAATCGAG AATATAAACA TTCAACAAGG AGAACAGCAC
AGTATGAGCA GTCTCAATCA TTATGGTACT GGCGATAACA TTGCGGGCGA TAAGGTTATG
GGTGACAAAA TTGAGACTCA AATCAACAAC AATCAAGATT TAGCTCAGGC TTCCAAAGAC
ATCAAAGCCC TGCTAGAACA GCTATCTGTG GACTATCCTA GTGACAGCCC TAGAGTTTTA
GGAGCAAAGG CTGTGGATGA AGTTGAAAAA AATCCAGAGA TGAAGTCTCG AATTTTGCGA
GGAGTTAAAG CAGGTAGTTT TGCAGCTTTA GAAAAAATGA TTGATCACCC TGTCGCCAAA
TTTTTCATTG AAGGAGCAAA AGAAGTCCTG AAGCCTTGA
 
Protein sequence
MPPLPSSNNL RGQDLRNGDY ARQDFRYRDI RGTNFSNANL EGADFTGATA GVTYSWALCL 
LLTALFIASL SGFTASIIIT FTVYFLVGSK IPRPIALLIA VSFISLVRAG LDHHFGAFVV
ALNIYFSIAL ILAVALFGAT AAAVDEPKSS IGSTALATFA CMALLIVIVV NVISNIQPVG
GIIGGVVGGI FGGIIGSYFG RKAIDGDSKF CWIWKLYLRF AIKGGTKFQG TNLTNAIFTN
TILKGSELRD AVITNANWKS AKFLELSRFD NSLLANSKVR ELLVTGLGNN QNYAGLDLRG
INLSQAKLNG ANLKNSDISG SILIGADLQF ANLAEVKATN TDFTHAFLNG ACIENWHINS
RTKFEDVKCD HVYLRENKQE RRPSNSDDYF EGSEFITLVE KYHETIDLIF NNGIDWTALL
TSIYKLQTKS RDSNLSIQAI ERKQGGFFVV RVDVPQNLDK AEAERFLWDK YKKKLKEIEE
SYSARLSLKD EKLSVYLQQI NDYRQQNTSL VELVKQKAVS EKIQIENKIE NINIQQGEQH
SMSSLNHYGT GDNIAGDKVM GDKIETQINN NQDLAQASKD IKALLEQLSV DYPSDSPRVL
GAKAVDEVEK NPEMKSRILR GVKAGSFAAL EKMIDHPVAK FFIEGAKEVL KP