Gene PCC8801_3367 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3367 
Symbol 
ID7103021 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp3516452 
End bp3517894 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content40% 
IMG OID643476382 
Productexopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 
Protein accessionYP_002373491 
Protein GI218248120 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATC TTCCCTCTCA CACCCAGCAA TTACTCCCTG ATATTCGGGC TCCTCATCCC 
TTGAATTGGT TAACATTAGC TAATCAACAA AGGTATCGTA TTCTTATTTT AATTTTAAGT
GATTTGATTG CCTTAGCTGG AGCTTGGTTA ATTGCCCGTT ATTGGAATCA ATTTTATTCT
CCGATTCCCC CTGAATTAGA CTGGTGGAAT TGGTTAGGAC TGCCCAGTTT ATTCTGGATT
TTTGGAGCCG TTACCCTCAT CTTTTTTGGA TATGGAGGGT TGTATAGTGC CTCCCTTCGC
AACCAGAATT ACATCCGTTC TGCTAAGATT ATTAGCCTAG TTTATTTATT ATCTTTAGTC
GTTAGTTATT TCTATGATCC TAAACTTGAT CCACCGCGAT CGCTCTTTTT TACCGCGTGG
GTAAGTAGTG TTGTCATGGT GTTAGGATTT CGCTTATTAA TTACCCTAAT TTTCGGACAA
ATCTACACTA AACAAAAAGA AGTTTCTGTG TTTGTTATTG CCAGCGCGTC TCGCCTCAAA
AAACTCTCTC AGATATTACA AAAACGCTCT TGTTATAAAA TTGTTGGGGC TGCCTTAGCT
TCCACTGCCA ATAGTCCCGC TACATTACGC GCAATTTTGC AATCAGAGGC GGTAGAAGTG
TTAGCGGAAG ATTTACCCCA AACAGCCCTA GCGTCGACCT TATATTGGAA TTTACGCCGT
GCTGGTGTGG CATTACGCTT ATTACCCTCA AGTCGAGAAA TTCTCTACCG TCGGGGAGTG
CCTGAAATCT TTGCGGGTCT GCCAACATTA CGAGTACAAA CCTCTTCCAT GGTAGGGTGG
GATTATCGTG TTAAACGGGG GTTAGATTGT TTGGGAGCTT TAATCGGAAT TATTCTGTTA
TCTCCTTTGT TTTTAGGGGT AGCTATTTTG ATTCAATTGT CCTCTCCTGG TTCCGTCTTT
TTCTGTCAAG AAAGAATTGG ATTACACGGC AAAGTTTTTC AAATGTGGAA ATTCCGAACT
ATGGTTCCTA ATGCGGCTCA ATTACAAGCA CAATTAGAAG CCCAAAATGA GTCAGGTGAT
GGGGTATTAT TTAAAATAAA AAATGATCCG CGTATTATTC GCTTTGGTCA TTTCTTGCGA
AAAAGTAGTA TTGATGAATT GCCTCAACTG TTTAATGTTT TAATCGGTCA AATGAGTTTA
GTTGGCCCCC GTCCTTTGCC TTTACGCGAT GTAGAACGCT TTGAAGAATG GCATCATATT
CGTCATCAAG TCTTACCAGG AATTACAGGA CTGTGGCAAA TTTCGGGACG ATCAGATATT
GAAGATTTTA GTGATGCTGC CCGCTTAGAT CTATACTATA TTGATAATTG GTCATTGAAT
TTGGATTTAG ATATCTTAGT TGAAACCGTC AGAATTGTTT TGTTTGGCAA AGGAGCCTAT
TAA
 
Protein sequence
MKNLPSHTQQ LLPDIRAPHP LNWLTLANQQ RYRILILILS DLIALAGAWL IARYWNQFYS 
PIPPELDWWN WLGLPSLFWI FGAVTLIFFG YGGLYSASLR NQNYIRSAKI ISLVYLLSLV
VSYFYDPKLD PPRSLFFTAW VSSVVMVLGF RLLITLIFGQ IYTKQKEVSV FVIASASRLK
KLSQILQKRS CYKIVGAALA STANSPATLR AILQSEAVEV LAEDLPQTAL ASTLYWNLRR
AGVALRLLPS SREILYRRGV PEIFAGLPTL RVQTSSMVGW DYRVKRGLDC LGALIGIILL
SPLFLGVAIL IQLSSPGSVF FCQERIGLHG KVFQMWKFRT MVPNAAQLQA QLEAQNESGD
GVLFKIKNDP RIIRFGHFLR KSSIDELPQL FNVLIGQMSL VGPRPLPLRD VERFEEWHHI
RHQVLPGITG LWQISGRSDI EDFSDAARLD LYYIDNWSLN LDLDILVETV RIVLFGKGAY