Gene PCC8801_0746 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_0746 
Symbol 
ID7102799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp772368 
End bp773522 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content39% 
IMG OID643473844 
Productglycosyl transferase family 2 
Protein accessionYP_002370986 
Protein GI218245615 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID[TIGR03472] hopanoid biosynthesis associated glycosyl transferase protein HpnI 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGAATA TCTTACAAAC CCTGTGTTTA GTCCCCATTT TAGGGGGTTC CATCTTCTCA 
ATATTAACCG TGTGGACAGC GCAACGCTTC CTCAAAAAAT CTAACCGAAC AGTCCTTAAT
GGGTTTACTC CCCCCGTCAC CGTTCTCAAA CCAGTACGCG GGTTAGAAAA AGATCTAAAA
CTAAATCTGC GAACAATTGC CACCCAAAAC TATCCCGAAT ACCAGGTTAT TTACTCTGTT
CAAGATCCCA ATGATCCCGC TTTTCCTATT ATTAAAGAAA TTCAAGAGGA ATTTGGTAAA
GAGAGAATTT CCGTGGTAAT TAGTACCGTT GAAGCGGGGG CAAATGGTAA GGTAAATAAC
CTATTAGGAG CCCTTAAAGA AGCTCGTCAT GATATTATTA TTATTAGCGA TAGTGATACC
CATTTACGAC CTGATTATCT CAGCAATATT GTTGCACCTT TAGCTAATTC TGAGGTAGGT
TGTGTCTGTA CTTTATTTAA AGTAACACGA GGCGATCGCT GGTTTGAAAA AATGGAATTA
TTAACCATGA ATGCGGACTT TATGCCAAGT GTTATCTTTG CAGAAGTAAC CGGAACATCT
AAAGCTTGTT TAGGTCCTTC TATTGCTATT CGACGTTCAA CCTTAGACGA ATTAGGGGGG
TTAGAAAGTT TAGCAGATTA TCTAGTAGAA GACTACGAAA TAGGACGACG AGTGTGGACT
TCTGGTAAAA AAATGGTACT TGTCCCCTAC ATTATTGATG CTGTTGTAGA CTTAAAAAAT
TGGCAAAATT GGTGGACACA TCAAGTCTAT TGGGATCAGA ATACTTACTT AGCTAAACCG
ACTCCTTTTA TCGCTACTAT CCTGATTCGT GCCATTCCCT TTGCGCTATT GTTTGCCTTA
ATTCGGGGTG ATATAATTGG GCTATCTGTT TTGGTATCAG CTATTATTAT TCGTTTAATT
ACCGCAGCGA TGACTGCTTG GGAAATGAAA GATTTTGAAA CCATTAAAAG TCTCTATTTA
TTACCTTTTC GGGATTTAAT TGGCTTAGTC TTTTGGGGGT TATCTTTTAC GCAAAGAACC
GTTGTTTGGC GTGGTGTTGA ATTTAAACTA ACCAGTCATG GTAAAATGGT TATGCGTCGG
TCTTTACCCA GTTAA
 
Protein sequence
MLNILQTLCL VPILGGSIFS ILTVWTAQRF LKKSNRTVLN GFTPPVTVLK PVRGLEKDLK 
LNLRTIATQN YPEYQVIYSV QDPNDPAFPI IKEIQEEFGK ERISVVISTV EAGANGKVNN
LLGALKEARH DIIIISDSDT HLRPDYLSNI VAPLANSEVG CVCTLFKVTR GDRWFEKMEL
LTMNADFMPS VIFAEVTGTS KACLGPSIAI RRSTLDELGG LESLADYLVE DYEIGRRVWT
SGKKMVLVPY IIDAVVDLKN WQNWWTHQVY WDQNTYLAKP TPFIATILIR AIPFALLFAL
IRGDIIGLSV LVSAIIIRLI TAAMTAWEMK DFETIKSLYL LPFRDLIGLV FWGLSFTQRT
VVWRGVEFKL TSHGKMVMRR SLPS