Gene PCC8801_4271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4271 
Symbol 
ID7103817 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4484917 
End bp4486185 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content39% 
IMG OID643477251 
Productglycosyl transferase group 1 
Protein accessionYP_002374350 
Protein GI218248979 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGAGG AAAAAATTAT GCGTATTGGC TATGTCGTAA AACGCTATCC CCGTTATTCA 
GAAACCTTTG TGGTTAATGA AATTTTAGCC CACGAAAATG CGGGTTTAAC TATCAACATT
TTTGCCCTAC GTCCTCCTTG TGATACCCAT TTTCAAAATT GTATCTCACA AGTTCGTGCC
CCTGTTACCT ATATCCGTAA ACCTGTTGAA GGACGGATGA GTGAGTCCCT TAATAGTACT
TCTCCGACGG CTGCTAGTTA TTTTTGGGCA GAATTACAAG AATTAGGACA GGTAATTCCT
GATTTTTGGC AAAAACTAGC AATTGCCCAA GGAGAACGCG CCAGTACTGT CTATCAAGCT
GCTTGGTTAG CCAGAGAAGT CCGGTTACGG GGAATTAGCC ATCTACACGC CCATTTTGCC
TCAGTTGCTA CCAGTGTCAC CCGTTTAGCG GCACATTTTG CGGGGGTTCC CTACAGTTTT
ACTGCCCATG CTAAGGATAT TTTCCATGAA AGTGTTGATT TTGACGATAT GACGCGAAAA
TTGCGAGATG CGTCTAGAGT GGTGACAGTG AGTGACTATA ATAAACAATA TCTACAACAA
ACCTATGGAA AAGTCGCCCA AAATGTTGAA AGAATTTATA ATGGTTTAAA TTTATCTGAA
CTCAATTATC AATCTCCTGA AAATCGTCCT TCTCGCATCA TTTCTGTGGG GCGTTTAGTC
GAGAAAAAAG GGCTTTCAGT TTTAATCAAT GCCTGTGCCT TATTAAAACA ATGGGGCTGT
CACTTTCAGT GTCAAATTGT TGGAAATGGC AACTTAGAAA CTGCATTAAA TCAACAAATT
GAAGCCTTAA AATTGCAATC TTTTGTAAAG ATAATGGGAC CTCGACCTCA AAATGAAGTT
TTTGAGTTAA TCCAAGAAAG TGCCGTATTT GCTGCCCCTT ACTTAATTGG AAAAGATGGC
AATAGAGATG GTTTACCCAC CGTATTATTA GAAGCAATGG CGTTAGGAAC CCCTTGTGTA
GCAACAGATG TTACGGGTAT TCCTGAAATG ATTAGACATC AACAAACAGG GTTAATAGTT
CCTCAAAATA ATGCTGAAGA CTTGGCTATT GCGTTGCGAA CCTTATTAAC TGATAAGACT
CTCAGAGTTC AATTATCAAG TAATGCAAGG AAGTTGATGG AATCTGAGTT TAATATAACT
CATAATTCTG CTGCCTTACG AGAGGTATTT ATCTCTTCAA ATCATCAACT ATTAGCCGTC
AATTCCTAA
 
Protein sequence
MSEEKIMRIG YVVKRYPRYS ETFVVNEILA HENAGLTINI FALRPPCDTH FQNCISQVRA 
PVTYIRKPVE GRMSESLNST SPTAASYFWA ELQELGQVIP DFWQKLAIAQ GERASTVYQA
AWLAREVRLR GISHLHAHFA SVATSVTRLA AHFAGVPYSF TAHAKDIFHE SVDFDDMTRK
LRDASRVVTV SDYNKQYLQQ TYGKVAQNVE RIYNGLNLSE LNYQSPENRP SRIISVGRLV
EKKGLSVLIN ACALLKQWGC HFQCQIVGNG NLETALNQQI EALKLQSFVK IMGPRPQNEV
FELIQESAVF AAPYLIGKDG NRDGLPTVLL EAMALGTPCV ATDVTGIPEM IRHQQTGLIV
PQNNAEDLAI ALRTLLTDKT LRVQLSSNAR KLMESEFNIT HNSAALREVF ISSNHQLLAV
NS