Gene PCC8801_0042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_0042 
Symbol 
ID7105309 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp44334 
End bp45485 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content38% 
IMG OID643473158 
Productpolysaccharide biosynthesis protein 
Protein accessionYP_002370305 
Protein GI218244934 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3754] Lipopolysaccharide biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCTTAA ACTCAGAACC TATTAATTCT TCAGAATCCC CATCTGTTTA TTCCGAATCT 
TTTGTTGATA ATATTCGTTT AATTGCTTTC TATTTACCTC AATTTCATCC TATTCCTGAA
AACGATCAAT GGTGGGGCAA AGGATTTACT GAATGGACGA ATGTTACTAA AGCTAAACCA
CAGTTTCCAG GGCATTATCA GCCCCATTTA CCAGCCGATT TAGGCTTTTA TGATTTACGC
CTTCGAGAAG CTCGACAAGC ACAAGCAGAC TTAGCTAGGG AATATGGAAT TTACGGCTTT
TGTTATTATC ATTATTGGTT TAATGGTCAA CGAATTTTAG AACGTCCCTT TAATGAAGTG
TTGCAATCAG GAGAGCCAAA TTTCCCCTTT TGTTTGTGTT GGGCTAATGA AAGTTGGACA
AGAAGATGGG ATGGACAAGA GCAAGAAATT TTGATGAAAC AGGTTTACAC AGAGCAAGAT
GATCAACAAC ATATTCGTTA TTTAGCTGAA GCTTTTCAAG ACCCAAGATA CATTCGGGTT
AAGGGAAAAC CCTTATTTTT AGTTTATCGT GCTAATCAAC TACCTAACCC CTTGAAAACT
ACTGAAATTT GGCGAGAAGA AGCCCAAAAG TTAGGCGTAG GAGAAATCTT TTTGGCTAGG
GTTGAAAGCT TTTTAGATGA ACACAATGAT CCTCGAAAAA TCGGATTTGA TGCAGCCGTT
GAATTTCAAC CAGATTGGGG AAAACTCGGC AAAAAATTGC AATCACGAAA GCGTTGGGAA
ATTGCTAGAA AATATGGGTT AGCTCATCAA TCGTATGGGA TTCATAATAT CTTTGACTAT
CAAACGATGG TTACCCGAAT GCTTTCCAAA CCTATTGTTA ATTATCCACG ATTTCCTGGT
GTTACTCCAT CTTGGGATAA TACAGCACGT CGTCAAGTTG CTGCAACTAT TTTGAAAGAT
TCTACCCCTG AAATTTACGA ATATTGGCTC AAAGCAGTTA TTGAAAAAAC AATCTCCAAA
CCGGAACTTC CTCCTATCAT TTTTATCAAC GCTTGGAATG AATGGGCTGA GGGAAATCAT
TTAGAACCCT GTCAACGGTG GGGAAGGAGT TATTTAGAAG CAACCCAACG AGCCATTAAA
CAATTTTCGT AG
 
Protein sequence
MTLNSEPINS SESPSVYSES FVDNIRLIAF YLPQFHPIPE NDQWWGKGFT EWTNVTKAKP 
QFPGHYQPHL PADLGFYDLR LREARQAQAD LAREYGIYGF CYYHYWFNGQ RILERPFNEV
LQSGEPNFPF CLCWANESWT RRWDGQEQEI LMKQVYTEQD DQQHIRYLAE AFQDPRYIRV
KGKPLFLVYR ANQLPNPLKT TEIWREEAQK LGVGEIFLAR VESFLDEHND PRKIGFDAAV
EFQPDWGKLG KKLQSRKRWE IARKYGLAHQ SYGIHNIFDY QTMVTRMLSK PIVNYPRFPG
VTPSWDNTAR RQVAATILKD STPEIYEYWL KAVIEKTISK PELPPIIFIN AWNEWAEGNH
LEPCQRWGRS YLEATQRAIK QFS