Gene PCC8801_3387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3387 
Symbol 
ID7103091 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp3533702 
End bp3534976 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content37% 
IMG OID643476402 
Productglycosyl transferase group 1 
Protein accessionYP_002373511 
Protein GI218248140 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAATCC TAATTTATTC TTACAACTAT CATCCTGAAC CCATCGGTAT TGCTCCTCTG 
ATGACAGAAT TAGCAGAGGG ATTAGTCAAA CGCGGACATC AAGTGCGGGT AGTAACGGCA
ATGCCTTGGT ATCCTTCAAG TGAAATTTCT GCTGAGTATC GCGGAAAATT GTATCTAACA
GAAGACCGTA ACGGGGTTAA AATTCAACGA TGCTATGTTT GGATTCGACG CAAACGCAAT
TTTAAAAATC GTGTTTTATT TGAATTAAGC TTTGTTTTTC TGAGTTTTCT ACAAGCGTTA
CAAGGATGGC GACCGGATGT TATTTTTTTG ACAATTCCTG GTTTACCCGT TTGTGTTCCA
GCAGCTATTT TAGCTCGGTT ATATCGTATT CCTATTCTTT TAAATCTTCA AGATATTCTG
CCTGATGCTG CTATCCATGT GGGTTTAATC ACTAATCAAA AAATGATTAA AGTGTTTCAA
TGGTTAGAAG CATTTGCTTA TAAAACGGCA ACTAAAATTA GTGTTATTGC TGATGGATTT
ACCAAAAACT TAATCAGTAA AGGAGTTCCA TCGGATAAAA TTGTTGAGGT TCCTAACTGG
GTTGATGTCA ACTTTATTAA ACCTTTACCT CAAGAGAATA ATTACTTTCG CCAAGAGAAT
AATTTGGCAA ATAAATTCGT TATTCTATAC TCTGGTAATA TTGCCTTAAC TCAACCGTTA
GAAACCTTAA TTGATGCAGC AGCATTAGTC GGATATATTC CAGAAATTGC TATCGTGATT
GTAGGGAAAA AAGAGGCTCT AGAAAGGCTA GAAATATATC GGCAAAGAAA ACAAGCCAAT
AATGTCATTT TAAGACCTTT TCAGCCGAGA GAAAAATTAC CCGAAATGTT AGCAGCGGCC
GATGTGGGAA TGGTGATGCA AAAAGGTAAT GTAATTGCCT TTAATATGCC CTCAAAAATT
CAAGTTTTGT TAGCCAGTGG TCGAGCGATT ATTGCTTCTG TCCCAGCCGC AGGAACAGCA
GCTAGAGCCA TTAAAAAAAG TGGGGGAGGA ATTGTGGTTC CTCCCGAAGA TCCTCAAGCC
ATTGCTAAGG CTATTGTAGA CTTTTATTCT AATCCAGATT TAGTGGCTCG TTTAGGGCAA
CAAGGAAGAG AATATGCTAT CCAAAATTAT GCTTTTGATT CAACGTTAGA TCAATATGAA
AACTTATTAC AGTCAGTGGT TAAACCCCTT AAAAGTAAGG AGGAATTGCA TAAAATAATC
AACAATAAGG AATAG
 
Protein sequence
MRILIYSYNY HPEPIGIAPL MTELAEGLVK RGHQVRVVTA MPWYPSSEIS AEYRGKLYLT 
EDRNGVKIQR CYVWIRRKRN FKNRVLFELS FVFLSFLQAL QGWRPDVIFL TIPGLPVCVP
AAILARLYRI PILLNLQDIL PDAAIHVGLI TNQKMIKVFQ WLEAFAYKTA TKISVIADGF
TKNLISKGVP SDKIVEVPNW VDVNFIKPLP QENNYFRQEN NLANKFVILY SGNIALTQPL
ETLIDAAALV GYIPEIAIVI VGKKEALERL EIYRQRKQAN NVILRPFQPR EKLPEMLAAA
DVGMVMQKGN VIAFNMPSKI QVLLASGRAI IASVPAAGTA ARAIKKSGGG IVVPPEDPQA
IAKAIVDFYS NPDLVARLGQ QGREYAIQNY AFDSTLDQYE NLLQSVVKPL KSKEELHKII
NNKE