Gene PCC8801_1074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1074 
Symbol 
ID7105066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp1128685 
End bp1130190 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content39% 
IMG OID643474166 
Productprotein of unknown function DUF1555 
Protein accessionYP_002371304 
Protein GI218245933 
COG category[S] Function unknown 
COG ID[COG4222] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR02595] PEP-CTERM putative exosortase interaction domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAACTG CAAAACGCTT CTTTTCTGCA TTAATTAGTA CCTTGACGTT ATCGGTTTTA 
ACAGATCAAA CGGCTCACGC TGTTTCCTTT GTTAATAACA TTGTTATTTC GTCTGATCAA
ACCGATTTAA GTGGTGAACC CGATGGACTA AATGGTAATC GCTTAGCTGG AATTTTTTCT
GATCTTTATT ATGATCGTAG CAATAATGTT TATTATGGTT TAAGCGATGC TGGTCCCGGT
GGTGGAACTG TTTCTTTTAA CACCAAAGTT CAAAAATTCA CCCTAGATGT TAATCCTAAT
ACTGGGGAAA TTAGCAATTT TAATTTACTC GATACGATTC TATTTACTGA CAATGGTCAA
AACTTGAATG GATCAGATCC TAGCTTCCTC AATGGAGATA GTTCAGTTCT GGGATTAAGT
TTTGATCCCG AAGGGTTTGT CGTGGCTCCT AATGGTCATT TTTACGTCTC AGATGAATAT
GGACCCTCTG TCTATGAATT TTTGCCAGAT GGGTCATTTT TACGCGCTTT AACGACTCCT
GATAACCTGA TTCCTAAAAA CAATACTACC CCCAATTATG TTGATGGACG TGGTACGATT
ACCACAGGTC GTCAGGATGG TCGTGGATTT GAAGGATTAG CAATTAGTCC TGATGGGACA
CAATTATTTG CGATGTTACA AGCTCCTTTA GTTAACGAAG GAAACGCTAA TGATGGACGA
CTTAGTGCTA ATTTAAGAAT TGTTGAATTT GATACCACCA CAGGAACCAG TACCGCTCAA
TATATTTATC AGTTAGAAAG TTTGATTGAT ATTAATAATC GTATCCCTGG AACCAGCAAT
GATTTTCCAG CAACCAGTCA AGGAAGAAAC ATTGGAATTA GTGCCATTAC TGCGATTAAT
GAGGCAGAAT TTTTGGTAAT AGAAAGGGAT AATCGAGGGT TTGGGGTTGC TGCACCAACG
ACTACTGATA TTGCTGATAA TCCTGTGGGA ACTAAGCGAG TTTATCACAT TGATATCACT
GGAGCAACGG ATGTTAGTGG TCTTAGTTTA GCAGGAACAA GTACGTTACC AGGTGGGGTA
ATTCCTGTAA CAAAATCGCT GTTTCTCGAT CTCCAAAGTG AATTAGAAAC GGCCGGACAA
TTGGTCACAG AAAAACTAGA AGGATTAGCC ATTGGACCCC AATTAAATGA TGGAAGTTAC
GCTCTTTTAG TGGGAACAGA TAATGATTTT AGTGCGACTC AAGATAGTAA TGATGTTCAA
TTTGATGTCT GTACTAATGC GTTGACAACT AATCCTCTTG CTGAATCTCA ACAAGTTCCG
ATTAATACTC CGTGTCCTCT CGATTCCCAA AACAATCCCA TGAGTTTAAT TCCCACCTAT
CTCTATTCTT TCAAAGCAGA TGTTCCTAAT TTTGTCCCCT TACAAACTGT TCCAGAACCC
AGTGTAATCC TCGGAATAAT TAGCTTAGGG TTAGGTGGGT TGCTTCTTAA AAAAACTAAT
ACTTAA
 
Protein sequence
METAKRFFSA LISTLTLSVL TDQTAHAVSF VNNIVISSDQ TDLSGEPDGL NGNRLAGIFS 
DLYYDRSNNV YYGLSDAGPG GGTVSFNTKV QKFTLDVNPN TGEISNFNLL DTILFTDNGQ
NLNGSDPSFL NGDSSVLGLS FDPEGFVVAP NGHFYVSDEY GPSVYEFLPD GSFLRALTTP
DNLIPKNNTT PNYVDGRGTI TTGRQDGRGF EGLAISPDGT QLFAMLQAPL VNEGNANDGR
LSANLRIVEF DTTTGTSTAQ YIYQLESLID INNRIPGTSN DFPATSQGRN IGISAITAIN
EAEFLVIERD NRGFGVAAPT TTDIADNPVG TKRVYHIDIT GATDVSGLSL AGTSTLPGGV
IPVTKSLFLD LQSELETAGQ LVTEKLEGLA IGPQLNDGSY ALLVGTDNDF SATQDSNDVQ
FDVCTNALTT NPLAESQQVP INTPCPLDSQ NNPMSLIPTY LYSFKADVPN FVPLQTVPEP
SVILGIISLG LGGLLLKKTN T