Gene PCC8801_3228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3228 
Symbol 
ID7105508 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp3371005 
End bp3372399 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content43% 
IMG OID643476249 
Productglycosyl transferase family 39 
Protein accessionYP_002373359 
Protein GI218247988 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGATC TGACCTTCGC TTGGAACCGC TCACAAAATC AACGACATCA CGGCCAATAT 
TGGCAGGAAT TCCTGTCGGG GATTGGGCTA TTTTTGGCAG CCTTGCTGCT ATTTGGTCTT
AACCTTGACT CCCCTTTACT CAATGGCCAA GAAACAATCA TTGCTACGAT GGCTAAGGAA
ATGTCAGAAC AGTCTTTCAA TTTTGGCCGA TGGCTGTTTC CGACTGTCTG GGGAGACTCT
TTTTATCCCT ATCCTCCTCT GGGAATTGGT CTAACCACGA TCGCTTACTG GTGGGGGGGT
ATGGATGAAC AGACGACCCG TCTACCGGGT GCTATTTTAA CAGCTCTTTG TATTCCCCTT
TTCTATTTCC TGGGACGAGA AATTTTTGCG CGGTGGATGC CGGCCTTGTT TTCGGCACTG
ATTTTGTTAA CCCTTTTCCC TGTGGTTATT CAGGGACGGT TGGCTTTATT AGATGGGATC
ATGCTTTGTT TTCAACTCTT AACGATGTTT TGTTTATTGC GATCGCGTCG GGATCTTCGT
TGGTCTCTTG GAGTCGGTCT AAGTTTGAGT TGTCTGTTCT TAACCCATGG CATTATCGGT
CTTTTCCTGA GTTTCGTGGT CTTGATTTTT ATCGTGTGGG ACACCCCAAG ACTGTTAACG
TCGGCTTATT TTGGACTAGG GATGGTTTTA GGTTTGCTCC CGGGTATCGG TTGGTATACT
CAACAAGGAA TGCTCTACGG CCAGTCATTT TTTAGGGCGA TGTTTATTAC ACCTTTGGCT
GAAAATCACG GCTTGTTTCG GGGATTATTG GGAATTCATG CAGTGGAATT ACTCAAGTCG
TCTTTGCCTT GGTTAATCTT TGCGGTTTAT GGCGGGTTTT TGGCAACAAA ATATCTAGTT
TGGGGTTGGT CTAAGTTAAT TTTGGTATGG GGTGGGGTCT ATTTGATTGT TTTTAGCTTA
CTTCCTTTAC CCACAGTTAG TTATCTAATG CCATTTTATC CTCCTTTGGC CTTAGCGGGT
GGCATGGCTT TGGCTGAGGT GTATCATTGG CCGATCAATC GTCCCTATCC TAAGTTATGG
TGGATCATTT TATTAGGAAT GTCTGGGGTG ATTAGCCTAG TCAGTTTAAG TTTTTTGCTG
AATTTTCCCT GGGACTTTAG TTACCTTTCC TATCGATTTT TATTGATTGT TACGTTAGGC
TCGATCGCGT TTACTTTTTT GACAACTGCT GTTTTAATTG CTCAGCGTGA TTCTCAATTT
ATTGTCATTT TAGTCTGGGG AATGTATATT TCTTTATTAT TGTGGGTTAA TTCTCCCTAT
TGGACTGGAG AAATAAAAGC CAATTCTTCG ATAATTAATA CAATCGCCTT AAATGAATTT
TCAGGAATAT CCTGA
 
Protein sequence
MNDLTFAWNR SQNQRHHGQY WQEFLSGIGL FLAALLLFGL NLDSPLLNGQ ETIIATMAKE 
MSEQSFNFGR WLFPTVWGDS FYPYPPLGIG LTTIAYWWGG MDEQTTRLPG AILTALCIPL
FYFLGREIFA RWMPALFSAL ILLTLFPVVI QGRLALLDGI MLCFQLLTMF CLLRSRRDLR
WSLGVGLSLS CLFLTHGIIG LFLSFVVLIF IVWDTPRLLT SAYFGLGMVL GLLPGIGWYT
QQGMLYGQSF FRAMFITPLA ENHGLFRGLL GIHAVELLKS SLPWLIFAVY GGFLATKYLV
WGWSKLILVW GGVYLIVFSL LPLPTVSYLM PFYPPLALAG GMALAEVYHW PINRPYPKLW
WIILLGMSGV ISLVSLSFLL NFPWDFSYLS YRFLLIVTLG SIAFTFLTTA VLIAQRDSQF
IVILVWGMYI SLLLWVNSPY WTGEIKANSS IINTIALNEF SGIS