Gene Cyan8802_2949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_2949 
Symbol 
ID8392277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp2983220 
End bp2985541 
Gene Length2322 bp 
Protein Length773 aa 
Translation table11 
GC content43% 
IMG OID644980898 
Productglycosyl transferase family 2 
Protein accessionYP_003138632 
Protein GI257060744 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.496141 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAAAA AGTCAAATTT GGGTTCAAGC ACCCTTACTA AACCGAGAAA ACGGCGATCG 
CTGAAAGCAC GCCTATGGCG ATCGCCTTTT TTGCCCCTTC ATCTAGCTAC CCTAGTGATG
TTAGTGGGTG TTGGCCTCTT TATTGCCTTG TCTTTGAGTT GGTTACTGGG CAATCCTACC
ATCACTAATT TAGCCCTGAG TATCCATCAG CAGCAACTCG ATCCCCCCTG GTTTGTCCGT
GTTCCTGAAA CTCCCTATCG ACAGTTTTTA ATCGGCATTT TTGTGGCATT AGTGGGGATA
ATCTTGGCGA TTACTCAGAC CGTCCCGAAA CCGACGCGAT GGACTAAAGC GATTATTGCT
GGTATTTTAA TCGCTTTGGC GGTTCGTTAC CTCTTTTGGC GGATTTTGGC TACGTTGAAT
TTAAGTAATG CACTGACAGG CATTTTAGCC CTCGCCTTAT TATTCTTAGA AATTCCCTTT
GTTTTAGCGG GTTTACCCCA ACTTTTTTGG GTATCGAACA CGAGGAATTA CAGTAAGCAA
GCGGATGGTT ACGAAATTGC CGTTAAACAG GGAAAATATC GCCCTCCAGT GGATATTTTT
ATCCCGACTT ATAATGAACC CGAATCGATT GTGCGACGAA CGATTATCGG GTGTCAAGCT
ATTGATTATG AACCAAAAAC GATTTATCTT TTGGATGATG GACAGCGATC GCCCATTGAA
TCCCTGGCTC AAGAATTGGG CTGTAACTAT ATTACCCGTA GCGATCGCCG TCACTATAAA
GCCGGAAATC TAAACAACGC CCTCCAATAC ACCCAAGGGG ACTTAATTGC CGTATTTGAC
GCGGATTTTG TCCCCACTCG CAACTTTTTG CTGCGAACGG TGGGCTTTTT TCAACAACCT
GATATCGGGA TTGTTCAATC TCACCAAAAT TACTACAATC CTGACGCAAT TGTTCGGAAT
TTGGGGTTAG CTCAGTATTT AACCAGTAAT CGAGAAGGTT TCTCTCGCTA CGTTCAACCT
ACCCTCGATA GTGTGGGAGC AACAGTTTGT GATGGGTCTG CTTTTGTCGT TCGTCGTCGA
GATCTCGATA AAATTGGTGG TTTTGTGACA GAATCTTTGT GTGAAGATTA TTTTACGGGA
ATTTTGTTAG ATTCCCACCA TCAGAAGGTG ATTTACCTCG ATGAAAATCT CAGTGCAGGT
CTGGCGGCCG AGAGTTTGAA TGATTACGTC GGCCAGTATC AACGCTGGTT GATGGGCAGT
TTACAGGCGT TTTTTATTAA AACAAACCCT CTGACCCTTT CTGGGTTAAC GCTACGGCAA
CGGGTGGCTC ATCTGATGAG TTTAGTTTAC TGGTTGACGG GTTTTCCCCG TTTGCTGATT
TTGTTAGTTC CGATTATCTG CGGTTTAGCG TCGATTTTTC CGATTATTAT TACCCCTGAT
GATTGGCTGT ATTTCTTATT TTTGCCCCAT TTATTGCTAT TATTCTCGAT GCACTGGTTA
AGCGATCGCT CTACATCGAT GTTACTATCG GAAATTTACA CGATTATCCA TGCAATTCCT
TTTAGTTTAA CGGCGGTTCA AGTCTTTTTG CGACCCTTTT CTCGCGCGTT TCAGGTCACT
CCGAAGGGGT TTTTATCGGA CGGGTTTCGG GTCAATGCTT GGTTAACGAT TCCTTTGGGG
TTACTATGGC TAGGGAATGG AGGAATGCTG GTTAATTTTC TCTGGCAACG CATCTATAAC
CCCCAGGGTC TTCCGGCCGG GTTTGCTGAT ATTTGGGGAG GAACGGGCGG AATTTTGGTC
TTTTGGTGGG TTTATAATCT GATTTTCTTA GGGTTAGCTA TTTTAGCTTG TATTGATCCT
CCTAAACCAG AAACTTGTGA GTGGTTTAAG TTAGAACGAC CAATGGTTTT AAGTTGGGAA
AATAGTTTAA TCAAGGGAAT AACGCATCTG GTTTCTGAAA AAGGTGCGCG TGTTCGAGTG
AACACAAAAT CTCAGGAGAA ACTCAATATT TCCTGCGGAG ATGTTATCAG TATGGAAATT
AAATTAAATG AATGGCCAGG AAATCTTAGA GTAGAGGGAC GGGTTACGAA AATTATTAAC
ACTAAGGGTC ATTGTATTAA AGAAATTGAT ATTAAGTTTG AAGGGATGAC TTCTCAACAG
TATCGTCATT TAGTAGAACT GCTTTTTTGT CGGCCTGGTC AATGGGTGAG ACGAGAACAT
CCTAATGAAC TCATCACGCT TATCGCTTTA GGTAAACGAC TGTTACGACC TCGCTTTTTA
TTAAACAATG ATGAAGCTAT TGATGCTATT TCTATTCATT AA
 
Protein sequence
MPKKSNLGSS TLTKPRKRRS LKARLWRSPF LPLHLATLVM LVGVGLFIAL SLSWLLGNPT 
ITNLALSIHQ QQLDPPWFVR VPETPYRQFL IGIFVALVGI ILAITQTVPK PTRWTKAIIA
GILIALAVRY LFWRILATLN LSNALTGILA LALLFLEIPF VLAGLPQLFW VSNTRNYSKQ
ADGYEIAVKQ GKYRPPVDIF IPTYNEPESI VRRTIIGCQA IDYEPKTIYL LDDGQRSPIE
SLAQELGCNY ITRSDRRHYK AGNLNNALQY TQGDLIAVFD ADFVPTRNFL LRTVGFFQQP
DIGIVQSHQN YYNPDAIVRN LGLAQYLTSN REGFSRYVQP TLDSVGATVC DGSAFVVRRR
DLDKIGGFVT ESLCEDYFTG ILLDSHHQKV IYLDENLSAG LAAESLNDYV GQYQRWLMGS
LQAFFIKTNP LTLSGLTLRQ RVAHLMSLVY WLTGFPRLLI LLVPIICGLA SIFPIIITPD
DWLYFLFLPH LLLLFSMHWL SDRSTSMLLS EIYTIIHAIP FSLTAVQVFL RPFSRAFQVT
PKGFLSDGFR VNAWLTIPLG LLWLGNGGML VNFLWQRIYN PQGLPAGFAD IWGGTGGILV
FWWVYNLIFL GLAILACIDP PKPETCEWFK LERPMVLSWE NSLIKGITHL VSEKGARVRV
NTKSQEKLNI SCGDVISMEI KLNEWPGNLR VEGRVTKIIN TKGHCIKEID IKFEGMTSQQ
YRHLVELLFC RPGQWVRREH PNELITLIAL GKRLLRPRFL LNNDEAIDAI SIH