Gene PCC8801_1736 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1736 
Symbol 
ID7101809 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp1821396 
End bp1822574 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content37% 
IMG OID643474803 
Productglycosyl transferase family 2 
Protein accessionYP_002371939 
Protein GI218246568 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGTTA AGAATGAAGC CGAAAATTTA CCACCGTGTT TAGAAAGCGT TAGAAATGTT 
GTTGATGAAA TGGTGGTAAT GGATACAGGA TCAACGGATC AAACGGTAGA AATTGCTCAA
CAATTTGGTG CAAAAGTTCC TTACTTTGAA TGGTGTAATG ATTTTGCGAT CGCTCGTAAT
GCGGCTCTTG ATCATGTCAC AGGAGACTGG GTATTAATCT TAGATGCGGA TGAGAGATTA
AACCCCAATG TTGTCCCTCA ACTCAAACAA GCCATCACCG ATGAAAATAG TTTAGTCATC
AATTTAGTGC GTCATGAAAT TGGCGCATCT CAGTCTCCTT ATTCTTTAGT TTCGCGGTTA
TTTCGGAAAC ATCCAGAGGT TGAGTTTTCC CGTCCCTATC ATGCCATTAT TGATGATAGT
GTTAGTGAAT TGTTGAAAAA AGAAAGCCAT TGGAAAATTG TTGATTTACC CGCGATCGCA
GTTTTCCATT ATGGTTATGA TCCCCAAACC ATTACCGCTT TGGATAAATA TACCAAAGCG
CAAAAATCAA TGGAGGGATT TTTGGACAAA AATCCCAATG ATCCCTACAC TTGTAGTAAG
TTAGGGGCAC TATACTTACA AATTGGCAAG GAAAAAGACG GCATTAAATT ACTCAAAAAA
GGATTAAAAT CCAATAAAGC TGATGCTCAT GTTTTATTTG AATTACATTA TCATCTAGCT
AATGCTTATA CCCGTGAAAA TGAATCAGAA AAGGCTATTA AGCACTACCA AAAAGCCATT
GTTCAAGAAA TCATGGCTCC CTTAAAATTA GGTGCTTATA ATAATTTTGG AGTAGTATTA
CAAAGCATCG ATGACTTTAA AAATGCTGCT AAAATGTACG AAACAACCCT ACAAATTGAT
CCCAATTTTA TTACAGGCTA TTATAATTTA GCCATGACCT TGAGTAGCAT GGGACGCTTA
GCAGATGCAG AAGCGGTTTA TAATAAATTG CTCTCTCTAA GTCCTAATTA TGCACCAGCC
TATCAAAATT TAGGCGTTGT CTTATTTAAG TTAAAGAAAT TACCTGAAAG TTCAGCCGCG
TTTAAAAAAG CCATGAGTCT TTATGAATCG CAAAATTATC ATCAAGAAGC GCAAAAACTC
AAAGCTGGAC TACAAGAATT AGGCATTTGG GAAGAGTAA
 
Protein sequence
MIVKNEAENL PPCLESVRNV VDEMVVMDTG STDQTVEIAQ QFGAKVPYFE WCNDFAIARN 
AALDHVTGDW VLILDADERL NPNVVPQLKQ AITDENSLVI NLVRHEIGAS QSPYSLVSRL
FRKHPEVEFS RPYHAIIDDS VSELLKKESH WKIVDLPAIA VFHYGYDPQT ITALDKYTKA
QKSMEGFLDK NPNDPYTCSK LGALYLQIGK EKDGIKLLKK GLKSNKADAH VLFELHYHLA
NAYTRENESE KAIKHYQKAI VQEIMAPLKL GAYNNFGVVL QSIDDFKNAA KMYETTLQID
PNFITGYYNL AMTLSSMGRL ADAEAVYNKL LSLSPNYAPA YQNLGVVLFK LKKLPESSAA
FKKAMSLYES QNYHQEAQKL KAGLQELGIW EE