Gene PCC8801_4242 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4242 
Symbol 
ID7103795 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4451796 
End bp4452989 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content35% 
IMG OID643477223 
Producthopene-associated glycosyltransferase HpnB 
Protein accessionYP_002374322 
Protein GI218248951 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID[TIGR03469] hopene-associated glycosyltransferase HpnB 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAACA TACTATTATT AACGACAATT TTATCATTAA TCATTTGGGT TTATTTATTA 
CTCTTCAGAG GAGGTTTTTG GCTATCAAAT CAAAAGATCA AACCCCAAGC ACTAGGAATA
ACTGATTATC CTTCTGTTTA TGCTGTTATC CCTGCACGCA ATGAAGCTGA TGTTTTACCT
ATTAGTTTAA AATCCCTATT AAACCAAGAT TATTTAGGTC AATTTACTAT TATTTTAATC
GATGATCAAA GTAGTGATGG AACAGGAGAA GTTGCTCAAG AAATTGCTAA AAACTGTCAT
CAATCTAACC GTTTAATTGT TATTTCAGGA CAGACATTAC CCACTGGATG GTCAGGAAAA
TTATGGGCAA TGGAGCAGGG ACTTAAATAC ATAAAAAAGC ATAATTGTCA ACCAAAATAT
ATACTTTTTA CCGATGCTGA TATTGAACAT CATCCAACTA ATTTACAGGA ATTAGTAACA
AAATCTCAGC AAGAAAATTT AGCCTTAACT TCCTTGATGG TGTGGTTAAG ATGTCAAAGT
ATTTGGGAAC AATTTTTAAT TCCTGCGTTT GTCTTTTTCT TTGAGAAACT CTATCCTTTT
GCTTGGGTTA ACAACGCTAA AAATAAAATG GCTGCTGCTG CGGGAGGATG TATCCTCATT
CGTCGGGATA TCCTCGAAGA AATTGGAGGA TTAGAGATAG TCCGTCAAGC ATTAATTGAT
GATTGTTCCT TAGCTGCTGC GGTGAAATCT AAATTACAAC AGAACCCAAA CAATACCCAA
GGAATTTGGT TAGGATTAAG TGAAAAAACC CGTAGTTTAC GGCCTTATGA TTCCTTAGAA
ACGATTTGGA ATATGGTAGC CAGAACTGCC TATACGCAAC TCAATTATTC CCCTTTATTA
CTAAGTGGAA CAGTTTTAGG ATTAACCCTA GTTTATCTAA TTCCTATCTT GAGTTTAGCG
TTAGGATTAC TCCTAGGAAA TAGCTTAATT GCTCTTTTTG GGGGGATAAC TTGGATACTA
ATGGCTATTG CCTATTTACC TACTTTAATC CTTTATAAAG CCTCACCCTT ATGGTCGTTA
ACCTTACCAA TTATTGCCTT TTTATACTTA TTAATGACTA TAGATTCTGC GCTGCGTCAT
TGGCAAGGAA AAGGAGGTGC TTGGAAGGGA AGAGTTTATG CCAATAATGA ATAA
 
Protein sequence
MENILLLTTI LSLIIWVYLL LFRGGFWLSN QKIKPQALGI TDYPSVYAVI PARNEADVLP 
ISLKSLLNQD YLGQFTIILI DDQSSDGTGE VAQEIAKNCH QSNRLIVISG QTLPTGWSGK
LWAMEQGLKY IKKHNCQPKY ILFTDADIEH HPTNLQELVT KSQQENLALT SLMVWLRCQS
IWEQFLIPAF VFFFEKLYPF AWVNNAKNKM AAAAGGCILI RRDILEEIGG LEIVRQALID
DCSLAAAVKS KLQQNPNNTQ GIWLGLSEKT RSLRPYDSLE TIWNMVARTA YTQLNYSPLL
LSGTVLGLTL VYLIPILSLA LGLLLGNSLI ALFGGITWIL MAIAYLPTLI LYKASPLWSL
TLPIIAFLYL LMTIDSALRH WQGKGGAWKG RVYANNE