Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_1103 |
Symbol | |
ID | 8390414 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | + |
Start bp | 1128859 |
End bp | 1130364 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 644979118 |
Product | protein of unknown function DUF1555 |
Protein accession | YP_003136869 |
Protein GI | 257058981 |
COG category | [S] Function unknown |
COG ID | [COG4222] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR02595] PEP-CTERM putative exosortase interaction domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.428306 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.970899 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAACTG CAAAACGCTT CTTTTCTGCA TTAATTAGTA CCTTGACGTT ATCGGTTTTA ACAGATCAAA CGGCTCACGC TGTTTCCTTT GTTAATAACA TTGTTATTTC GTCTGATCAA ACCGATTTAA GTGGTGAACC CGATGGACTA AATGGTAATC GCTTAGCTGG AATTTTTTCC GATCTTTATT ATGATCGTAG CAATAATGTT TATTATGGTT TAAGCGATGC TGGTCCCGGT GGTGGAACTG TTTCTTTTAA CACCAAAGTT CAGAAATTCA GCCTAGATGT TAATCCTAAT ACTGGGGAAA TCAGTAATTT CAATTTACTC GATACGATTC TATTTACTGA CAATGGTCAA AACTTGAATG GATCAGATCC TAGCTTCCTC AATGGGGATA GTTCATTTCT GGGTTTAAGT TTTGATCCCG AAGGTTTTGT CGTGGCTCCT AATGGTCATT TTTACGTCTC AGATGAATAT GGACCCTCTG TCTATGAATT TTTGCCAGAT GGGTCATTTT TACGCGCTTT AACGACTCCT GATAACCTGA TTCCTAAAAA CAATACTACC CCCAATTATG TTGATGGACG TGGTACGATT ACCACAGGTC GTCAGGATGG TCGTGGATTT GAAGGATTAG CAATTAGTCC TGATGGGACA CAATTATTTG CGATGTTACA AGCTCCTTTA GTTAACGAAG GAAACGCTAA TGATGGACGA CTTAGTGCTA ATTTAAGAAT TGTTGAATTT GATACCACCA CAGGAACCAG TACCGCTCAA TATATTTATC AGTTAGAAAG TTTGATTGAT ATTAATAATC GTATCCCTGG AACCAGCGAT GATTTTCCAT CAACCAGTCA AGGAAGAAAC ATTGGAATTA GTGCCATTAC TGCGATTAAT GAGACAGAAT TTTTGGTAAT AGAAAGGGAT AATCGAGGGT TTGGGGTTGC TGCACCAACG ACTACTGATA TTGCTGATAA TCCTGTGGGA ACTAAGCGAG TTTATCACAT TGATATCACT GGAGCAACGG ATGTTAGTGG TCTTAGTTTA GCAGGAACAA GTACTTTACC AGGTGGGGTA ATTCCTGTAA CAAAATCGCT GTTTCTCGAT CTCCAAAGTG AATTAGAAAC GGCCGGACAA TTGGTCACAG AAAAACTAGA AGGATTAGCC ATTGGACCCC AATTAAATGA TGGAAGTTAC GCTCTTTTAG TGGGAACAGA TAATGATTTT AGTGCGACTC AAGATAGTAA TGATGTTCAA TTTGATGTCT GTACTAATGC GTTGACAACT AATCCTCTTG CTGAATCTCA ACAAGTTCCG ATTAATACTC CGTGTCCTCT CGATTCCCAA AACAATCCCA TGAGTTTAAT TCCCACCTAT CTCTATTCTT TCAAAGCAGA TGTTCCTAAT TTTGTCCCCT TACAAACTGT TCCAGAACCC AGTGTAATCC TCGGAATAAT TAGCTTAGGG TTAGGTGGGT TGCTTCTTAA AAAAACTAAT ACTTAA
|
Protein sequence | METAKRFFSA LISTLTLSVL TDQTAHAVSF VNNIVISSDQ TDLSGEPDGL NGNRLAGIFS DLYYDRSNNV YYGLSDAGPG GGTVSFNTKV QKFSLDVNPN TGEISNFNLL DTILFTDNGQ NLNGSDPSFL NGDSSFLGLS FDPEGFVVAP NGHFYVSDEY GPSVYEFLPD GSFLRALTTP DNLIPKNNTT PNYVDGRGTI TTGRQDGRGF EGLAISPDGT QLFAMLQAPL VNEGNANDGR LSANLRIVEF DTTTGTSTAQ YIYQLESLID INNRIPGTSD DFPSTSQGRN IGISAITAIN ETEFLVIERD NRGFGVAAPT TTDIADNPVG TKRVYHIDIT GATDVSGLSL AGTSTLPGGV IPVTKSLFLD LQSELETAGQ LVTEKLEGLA IGPQLNDGSY ALLVGTDNDF SATQDSNDVQ FDVCTNALTT NPLAESQQVP INTPCPLDSQ NNPMSLIPTY LYSFKADVPN FVPLQTVPEP SVILGIISLG LGGLLLKKTN T
|
| |