Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_1074 |
Symbol | |
ID | 7105066 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | + |
Start bp | 1128685 |
End bp | 1130190 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643474166 |
Product | protein of unknown function DUF1555 |
Protein accession | YP_002371304 |
Protein GI | 218245933 |
COG category | [S] Function unknown |
COG ID | [COG4222] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR02595] PEP-CTERM putative exosortase interaction domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAACTG CAAAACGCTT CTTTTCTGCA TTAATTAGTA CCTTGACGTT ATCGGTTTTA ACAGATCAAA CGGCTCACGC TGTTTCCTTT GTTAATAACA TTGTTATTTC GTCTGATCAA ACCGATTTAA GTGGTGAACC CGATGGACTA AATGGTAATC GCTTAGCTGG AATTTTTTCT GATCTTTATT ATGATCGTAG CAATAATGTT TATTATGGTT TAAGCGATGC TGGTCCCGGT GGTGGAACTG TTTCTTTTAA CACCAAAGTT CAAAAATTCA CCCTAGATGT TAATCCTAAT ACTGGGGAAA TTAGCAATTT TAATTTACTC GATACGATTC TATTTACTGA CAATGGTCAA AACTTGAATG GATCAGATCC TAGCTTCCTC AATGGAGATA GTTCAGTTCT GGGATTAAGT TTTGATCCCG AAGGGTTTGT CGTGGCTCCT AATGGTCATT TTTACGTCTC AGATGAATAT GGACCCTCTG TCTATGAATT TTTGCCAGAT GGGTCATTTT TACGCGCTTT AACGACTCCT GATAACCTGA TTCCTAAAAA CAATACTACC CCCAATTATG TTGATGGACG TGGTACGATT ACCACAGGTC GTCAGGATGG TCGTGGATTT GAAGGATTAG CAATTAGTCC TGATGGGACA CAATTATTTG CGATGTTACA AGCTCCTTTA GTTAACGAAG GAAACGCTAA TGATGGACGA CTTAGTGCTA ATTTAAGAAT TGTTGAATTT GATACCACCA CAGGAACCAG TACCGCTCAA TATATTTATC AGTTAGAAAG TTTGATTGAT ATTAATAATC GTATCCCTGG AACCAGCAAT GATTTTCCAG CAACCAGTCA AGGAAGAAAC ATTGGAATTA GTGCCATTAC TGCGATTAAT GAGGCAGAAT TTTTGGTAAT AGAAAGGGAT AATCGAGGGT TTGGGGTTGC TGCACCAACG ACTACTGATA TTGCTGATAA TCCTGTGGGA ACTAAGCGAG TTTATCACAT TGATATCACT GGAGCAACGG ATGTTAGTGG TCTTAGTTTA GCAGGAACAA GTACGTTACC AGGTGGGGTA ATTCCTGTAA CAAAATCGCT GTTTCTCGAT CTCCAAAGTG AATTAGAAAC GGCCGGACAA TTGGTCACAG AAAAACTAGA AGGATTAGCC ATTGGACCCC AATTAAATGA TGGAAGTTAC GCTCTTTTAG TGGGAACAGA TAATGATTTT AGTGCGACTC AAGATAGTAA TGATGTTCAA TTTGATGTCT GTACTAATGC GTTGACAACT AATCCTCTTG CTGAATCTCA ACAAGTTCCG ATTAATACTC CGTGTCCTCT CGATTCCCAA AACAATCCCA TGAGTTTAAT TCCCACCTAT CTCTATTCTT TCAAAGCAGA TGTTCCTAAT TTTGTCCCCT TACAAACTGT TCCAGAACCC AGTGTAATCC TCGGAATAAT TAGCTTAGGG TTAGGTGGGT TGCTTCTTAA AAAAACTAAT ACTTAA
|
Protein sequence | METAKRFFSA LISTLTLSVL TDQTAHAVSF VNNIVISSDQ TDLSGEPDGL NGNRLAGIFS DLYYDRSNNV YYGLSDAGPG GGTVSFNTKV QKFTLDVNPN TGEISNFNLL DTILFTDNGQ NLNGSDPSFL NGDSSVLGLS FDPEGFVVAP NGHFYVSDEY GPSVYEFLPD GSFLRALTTP DNLIPKNNTT PNYVDGRGTI TTGRQDGRGF EGLAISPDGT QLFAMLQAPL VNEGNANDGR LSANLRIVEF DTTTGTSTAQ YIYQLESLID INNRIPGTSN DFPATSQGRN IGISAITAIN EAEFLVIERD NRGFGVAAPT TTDIADNPVG TKRVYHIDIT GATDVSGLSL AGTSTLPGGV IPVTKSLFLD LQSELETAGQ LVTEKLEGLA IGPQLNDGSY ALLVGTDNDF SATQDSNDVQ FDVCTNALTT NPLAESQQVP INTPCPLDSQ NNPMSLIPTY LYSFKADVPN FVPLQTVPEP SVILGIISLG LGGLLLKKTN T
|
| |