Gene PCC8801_3962 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3962 
Symbol 
ID7103457 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4149145 
End bp4150797 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content36% 
IMG OID643476958 
Productputative glycosyl transferase 
Protein accessionYP_002374059 
Protein GI218248688 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGACGTGGC AACAATCGGT CTTTCAGATT TTGCTAGTAG GATTGGTTTT TCGTTCCATT 
CTTGCCTTAT GGTTGTTTCC TGGCTACGAT GAAGGCTATT ATTATCTCTA TAGTCAACAT
TTAGATTGGA GTTACTTTGA TCATCCACCT TTAGTTGCTT TAACAACGGG GTTTGGTGTT
TGGTTAACGG GAATTGCTTC ACCATTTACA CTCCGTTTAG GGACACTAAT TTTATATACA
GGCAGTCTTT TACTATTATA TCTCACCAGT GCTCGATTAT TTAATCATCA AGGGGCAAAA
ATAACATTAA TTATTGCTTC TATTATTCCT ATCTTTACCG TAGGTTTTGG GGTAATTACG
CTTCCTGATA GTCCTTTAAT CTTTTTTTGG TCAGCAAGTT TATATTGCGC GGTTTGTGAA
TTTTTTCCAA CAGCTAATCA AAGTTATCGT CCTAGCTATC GCCTTAGTAT TTTAGGAATT
TTGGTTGGGT TAGCTTGTTT GAGTAAATAT CATGGCTTTA TTCTAGCATT AGGATTAATT
GCCTTTTGTC TAACCAGTTC TAAGCATCGC TGTGCTTTAT TCTCTCCGTG GATGGGACTA
GCAATAGGAT TATTTATCAT CACGCTATTT CCTCTTTGGT TTTGGAATTT ACAGCATAAT
TGGGTTTCTT TACGATTTCA GTTATTTCAT CGCTTTGATC CTCCACCAGA GGGAATTGTT
TCACCTTCTG GCTATAATTT ATTGAAAGTT TTTGGGGTTT TTTTGACAGG TATAGGCTTA
CTTTTTCCGA CGATTGGTTT TCCTTTATGG TGGGTTAATC TACGAACGCT AATCAATCAA
TTTTCTGATA TTTTTCCGCA CAAATGGCTT CAATATTCAT GGGTTTTTCC TCTAAGGGAA
AGGGATGATT TTTTAAAAGA AAAACAGTTA TTAATTCTCT GGGTTTCTTT GCCGTTAATG
GTAGGATTTA CTCTGTTAGG AGGAAAACAA CAAATTCTTG TTACTTGGCC TATCCCTGGA
TTTTGGGGAG CCACTTTATT ATTAGGAATT TATGCGTTTC AATGGCAACA GAGATCGCGT
TATCTTATTA GATGGTGGTT AGGAGGAACA GGGATTATTA TTTCGACTAT TTTATTTCTA
TTGCTGCTAC ATATTACTAC AGGAACCCTA CAAAAATCCA GTCATTCTGC TATCTTTGGC
GGATTTTTGG AGCCTAAAAG TGATCCCGCT AATGAATTGA TTGATATTCA ACAACTCCGT
CAAGGGTTTG CTTCATCTCC TGTGCTTTTA AATGCTTTAC AAAACAGTCA TTTTATCTTT
AGTAATGCCT ATTACTTAGG AGGATTAATA GATTTAGCCC TCAGACCTCT AAACCCTATC
CCCGTGACTT GTTTTAGTTA CGATAGACGG GGATTTTCAT TTTGGCCGGA TACTGATCAA
TGGATAGGAG AAGATGCTCT TTATATTACC CTAGAAAGAT TCCATAAAAT GCCTCATTTA
AACAATGAAT TTAGGGATTA CTTCTTGAGT TTTCAAGAAA TTGGAACAGT TCCTATCCAA
CGAGGAGGGG TCGTTGTTAC CATTTTTCAT GTGTATCAAG CCAAAACATT ATTAAGACCC
TATAGTAGTC CAATCAATAA TCAATTGAAC TAG
 
Protein sequence
MTWQQSVFQI LLVGLVFRSI LALWLFPGYD EGYYYLYSQH LDWSYFDHPP LVALTTGFGV 
WLTGIASPFT LRLGTLILYT GSLLLLYLTS ARLFNHQGAK ITLIIASIIP IFTVGFGVIT
LPDSPLIFFW SASLYCAVCE FFPTANQSYR PSYRLSILGI LVGLACLSKY HGFILALGLI
AFCLTSSKHR CALFSPWMGL AIGLFIITLF PLWFWNLQHN WVSLRFQLFH RFDPPPEGIV
SPSGYNLLKV FGVFLTGIGL LFPTIGFPLW WVNLRTLINQ FSDIFPHKWL QYSWVFPLRE
RDDFLKEKQL LILWVSLPLM VGFTLLGGKQ QILVTWPIPG FWGATLLLGI YAFQWQQRSR
YLIRWWLGGT GIIISTILFL LLLHITTGTL QKSSHSAIFG GFLEPKSDPA NELIDIQQLR
QGFASSPVLL NALQNSHFIF SNAYYLGGLI DLALRPLNPI PVTCFSYDRR GFSFWPDTDQ
WIGEDALYIT LERFHKMPHL NNEFRDYFLS FQEIGTVPIQ RGGVVVTIFH VYQAKTLLRP
YSSPINNQLN