Gene Cyan8802_3914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_3914 
Symbol 
ID8393264 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp4029995 
End bp4031095 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content38% 
IMG OID644981839 
ProductWD-40 repeat protein 
Protein accessionYP_003139553 
Protein GI257061665 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.152653 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.245593 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAAT TAGTTAAAAA ATTTCAATTT ATCTATTTAT TTTACTTAGT CGCAAATATT 
AGTTTAATAG CAACATTAGT CTATGAGATT TTAACGCCAA CAACCATTGC TAGAGAGATT
TACCTTGAAG AAATTGCCCA AAGTTCTCAA CCGCCAATAA TGGTTAAAGA TATCCAAGGA
TTTAAGGGGG TTATCAAAGC TCTGACGATG ACTCCCGATG GCAAAATCTT GTTAGTTGGT
GCGGGGGATG CAACCCTTAA TGCAGTTGAT CTTGAACTCG AACAAGTCGT TTATTCTAAA
ACCCATAAAA TCAATGATTA TTCATCAATT GTGGTGACAT CTCAGCCAAC ATTTCTTGAT
GAAACGACAT CTAATGAAAC GACCTCTGAT GAGACTCCAT TAACTGGACC AATGTTAGCA
TTAGCGGATG ATGAAAACAT CAGAGTTTTG AGTTTAGTAG ATGGCAGTAA AGTCAACCTT
TTAAAAGGAC ATAGTGGAAA AATTAGTGAT TTAGCCCTCA GTCCTGATGA TAAAATACTG
GTTAGTGTTA GTGCTAGCGA TCGCACCATT CGGATTTGGG ATTTTGCAAC CGGGAATTTA
ATTGAAACCT TAGGGGTAGA CATTGGACCG ACGAATAATG TCGCGTTTAC TCCCGATGGA
ATGACGTTTG TCACGGGAGC TATTGGCGAT GATCGCACCT TAAAATTTTG GGATCTCCCT
ACCTTAGAAT TGATCCGATC TTCTCCCCAA CAACCCGGCT ATATTAACGA TCTCAAGATT
ACTCCCGATG GCAAAAAATT AGTAGCTGCG GTGAGAAATT ATATCAAAGT TTGGGACTTA
ACCACGGGGA AAGAACTCTT AAATATTAAA GGACCCAGGT TAGACATTAA TGCGATCGCT
ATTTCTCCAG ATAGTCGCGT AGTTGCCACT GCCAACAAAG AAGGAAATAT TATGCTTTTT
GATCTCACAA AAGGTCGTAA ATTAACGACC TTAGAAGGAC ATAAAGGATG GGTTCTTTCT
TTAGTTTTTA GTCCCGATGG ACGCTATCTT TATAGTGGGG CTGAAGATAA AATTATTAAA
ATTTGGCAAC TCCGTGCTTA A
 
Protein sequence
MNKLVKKFQF IYLFYLVANI SLIATLVYEI LTPTTIAREI YLEEIAQSSQ PPIMVKDIQG 
FKGVIKALTM TPDGKILLVG AGDATLNAVD LELEQVVYSK THKINDYSSI VVTSQPTFLD
ETTSNETTSD ETPLTGPMLA LADDENIRVL SLVDGSKVNL LKGHSGKISD LALSPDDKIL
VSVSASDRTI RIWDFATGNL IETLGVDIGP TNNVAFTPDG MTFVTGAIGD DRTLKFWDLP
TLELIRSSPQ QPGYINDLKI TPDGKKLVAA VRNYIKVWDL TTGKELLNIK GPRLDINAIA
ISPDSRVVAT ANKEGNIMLF DLTKGRKLTT LEGHKGWVLS LVFSPDGRYL YSGAEDKIIK
IWQLRA