Gene PCC8801_3566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3566 
Symbol 
ID7105791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp3716261 
End bp3717484 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content37% 
IMG OID643476576 
Productglycosyl transferase group 1 
Protein accessionYP_002373685 
Protein GI218248314 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGTTG CTTACTTAAT TAACCAATAT CCTAAAATTA GTCATAGTTT TATTCGACGA 
GAAATTCTCG CTTTAGAAGA GTTAGGGTTG CCGATTACGC GATTTTCTAT TCGTTCGTGT
GCAGAACCCC TAATTGATGA AGCGGATCAA CAAGAATTAG CCAAAACCAA CATTATTTTA
GATGCGGGAA TCTTGGGGTT ATTAATCAGT CTGCTGAAAG TGGCTATCAC TAGACCTCAA
CGTTGGATAG AGGCTTTTAT ATTAACCTTG AAATTGGGTT GGAAGTCTGA TAGAGGAATT
TTGTTATACT GTGCTTATTT GGCGGAAGCT TGTGTTTTAA TTGACCATTT TTCTGAACTA
CAGATATCTC ATTTTCATGC TCATTTTGGG ACTAATTCTA CCATGGTGGT TTTACTCAAT
CATATTTTAG GAGGGGCTTC TTATAGTTTT ACTTTACATG GTCCTAAAGA ATTTGAAAAA
GTAGAAGCGA TCGCTTTACC AGAAAAAATT AAACAAGCTG AGTTTGTTGT GGGGATTAGT
TCCTATGGTC GCAGTCAACT TTGTCGCTGG TGTGACTATA CTAAATGGGA CAAAATCAAG
GTTATTCATT GTGGTCTTGA TCAGTCTTTT TTTTCCTTGC CTCGCCAACC TATTCCTCAA
GAAAATACAT TAGTTTGTGT TGGAAGATTA TGTGAGCAAA AAGGACAATT ATTATTAATT
GAAGCAGCCA GTAAATTAGT GGCGCAAGGC TTTAAATTTA AGTTGATTTT AGTCGGAGAT
GGACCTTTAA GAGAACCCAT TGAACAAGCG ATCGCTCGTT GGCAATTAAA AGAGACGGTT
GAGATTACTG GATGGGCAAC TCAAGCAGAA GTTCAACAAC ATATTTTAGC CTCAAAAGCG
ATGGTTTTAC CGAGTTTTGC CGAAGGACTC CCAGTGGTAT TAATGGAAAG TTTAGCCCTT
GGTCGTCCTG TTATTAGTAC CTATGTTGCA GGGATTCCTG AATTAGTTAT TCCTGGTAAG
TCAGGATGGT TAGTTCCCGC AGGATCAGTT AATCCTTTAG TTGATGCTAT GAAAAAGGTT
TTAGAAACAC CAATTTCTCA ATTAGAGGAT ATGGGAAAAA CTGGAGCTAA CTATGTTAAA
GAACATCATA ATGTTTTGAC TGAAGCCCAA AAACTCATGT TATTATTTCA AGAAGCAAAA
CATTCAAATA AAAGTCCAGT TTGA
 
Protein sequence
MKVAYLINQY PKISHSFIRR EILALEELGL PITRFSIRSC AEPLIDEADQ QELAKTNIIL 
DAGILGLLIS LLKVAITRPQ RWIEAFILTL KLGWKSDRGI LLYCAYLAEA CVLIDHFSEL
QISHFHAHFG TNSTMVVLLN HILGGASYSF TLHGPKEFEK VEAIALPEKI KQAEFVVGIS
SYGRSQLCRW CDYTKWDKIK VIHCGLDQSF FSLPRQPIPQ ENTLVCVGRL CEQKGQLLLI
EAASKLVAQG FKFKLILVGD GPLREPIEQA IARWQLKETV EITGWATQAE VQQHILASKA
MVLPSFAEGL PVVLMESLAL GRPVISTYVA GIPELVIPGK SGWLVPAGSV NPLVDAMKKV
LETPISQLED MGKTGANYVK EHHNVLTEAQ KLMLLFQEAK HSNKSPV