Gene PCC8801_0051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_0051 
Symbol 
ID7103716 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp53891 
End bp54991 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content34% 
IMG OID643473167 
Productglycosyl transferase group 1 
Protein accessionYP_002370314 
Protein GI218244943 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATTT TATACGACGG TGAAATCTAC TCTAATCAAG TTGCAGGCGG AATTAATCGT 
TATTTTGCCA ATATTATTAG TCGGCTTCCC TCTGATTTTA CTCCCTCATT AATAGTAGAA
AGCTCCCCTG AACTAAACTA TCCTGTTCAT CCTAATTTAA AAAGTTTGTG GTGGTATAAA
AGATTTCGTC CAGAACGCCT TCGTATTTTG ACTGATAAAT TATATTCTAA TGCTATCAAT
AAGTTTAATC ATTTTGACCT TGCTCATCCC ACTTATTATT CATTAGTGAC TCGTCAACCT
CTAGATAATT ATAAGTGTCC TATTGTGATA ACGGTTTATG ATATGATTCA TGAACTTTTA
CCTCAACAGG TTCCCTATAG TAGTCATGGA ATTTCAATTA AAAGTAAAGC AATTAAATCA
GCACAAGCTA TCATTTGTAT TTCAGAAAAT ACTAAAAAAG ATTTAGTAAA TTTGTATTCC
ATTCCAGAAC ATAAAATATC CGTAACCTAT TTAGCAGCAG AAATTGATGT TAGTCTATCT
TATGGGTCTG AAGTGGTGCC AAAAGATCCT TATTATCTGT ATATTGGTAG TCGAGCTAAA
TATAAGAATT TTGACCGTTT ATTACTAGCT TTTGCAAAAA CTATTTCAGC GCAATCTGAT
CTGAAATTGT GTGTTATAGG TTCACCTTTT AATGAGAAAG AAGCAAAAAG AATTGCTGAA
CTAAAGTTAG GTGATCATCT AGAAAATTAT GGATATGTCA GTGACTCTCA TCTTGCTAAA
CTTTATCGTA ATAGTATGGC TCTTGTTTAT CCTTCCCTAT ACGAAGGTTT TGGTATTCCT
CCTCTTGAAG CAATGTCCTG TCAAACGGCT GTAATTGCTG CCAACTCATC GAGTCTTCCT
GAAGTTGTAG ATGATGCTGG TTTGCTATTT AATCCTGAGT CTACTGATGA ATTAGCAGAA
CAATTAATCT TTTTGCTTAA TCATCCTATA GAACGGGAAA ATTTAATTAC AAAAGGTTAT
GCAAGAAGCA AGTTATTTAC TTGGGAAAAA ACTGTAGCTG AAACCATTGA TGTTTATCGT
TCCCTCACTG AATCAAGGTA G
 
Protein sequence
MKILYDGEIY SNQVAGGINR YFANIISRLP SDFTPSLIVE SSPELNYPVH PNLKSLWWYK 
RFRPERLRIL TDKLYSNAIN KFNHFDLAHP TYYSLVTRQP LDNYKCPIVI TVYDMIHELL
PQQVPYSSHG ISIKSKAIKS AQAIICISEN TKKDLVNLYS IPEHKISVTY LAAEIDVSLS
YGSEVVPKDP YYLYIGSRAK YKNFDRLLLA FAKTISAQSD LKLCVIGSPF NEKEAKRIAE
LKLGDHLENY GYVSDSHLAK LYRNSMALVY PSLYEGFGIP PLEAMSCQTA VIAANSSSLP
EVVDDAGLLF NPESTDELAE QLIFLLNHPI ERENLITKGY ARSKLFTWEK TVAETIDVYR
SLTESR