Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_1411 |
Symbol | |
ID | 7103623 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | + |
Start bp | 1477119 |
End bp | 1478987 |
Gene Length | 1869 bp |
Protein Length | 622 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643474488 |
Product | glycosyl transferase family 39 |
Protein accession | YP_002371625 |
Protein GI | 218246254 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACAAAA TAACCTTACA ACAAGAACCT ATTGAGCCTG AGATTCTTGC TAAAAAATCT CCAAAAACAC TTTGGATATT TAGTATTTTA TGGCTATTAT TAATTAGCTG GATTGCCTTT TTATGGAATT TGGGTGCAAT TGGACTGGTA GATAAAACTG AGCCAATGTT TGTCGAAGCA TCCCGTCAAA TGGCCTTAAC AGGAGATTGG ATAACACCCT ATTGGAATGG AGAAACTCGT TTTGATAAAC CTCCCTTGAC CTATTGGTTA ATATCCCTAT CTTTTAGGGT ATTTGGAGTT AATGAATGGG CTGCTCGTTT CCCCTCTGCT GTATTTGCGA TCGCCCTAGT GTGCCTAGGA TTTTATACCC TACGTTATTG GGGATTTTCC GATACCAACG TTCTTGGATC TTCAACGCAA GATAAAAACA ACTCAAATGA GCAGCAATTA TGGTTTTCAG CCATTATAGG ATCGGCAATT ATTGCCCTTA ATCCTTTTTG GATTGCTTGG GGAAGAACAG GGGTTTCTGA TATGTTTCTC TCTAGCAGTA TTGGGTTAGC CTTACTATCA TTTTTTATTG GCTATGTACA ACCTAAAACT GCATTACCTA AACCGTTAGG GTTATCAGTA AAAAATTGGT GGTATGTTGG CTATTGGGTA TTTATGGCCT TAGGGGTTTT GGCAAAAGGT CCCGTAGCAC TCGTCCTTCC TGGCTTTGTT GTTATCGCTT TTTTATTATA TGTAGGTCGG TTCCTAGAAG TGGTTAAAGA AACCCCTTGG TTATTAGGAA TTGGCAGCTT TATCTTAATT GCTGTTCCCT GGTTTATTTT AGTGACCCTA GAACACGGAC AAGAATATAT TGATACCTTC TTTGGATTTC ACAACGTTCA GCGATTTACT AGCGTTGTTA GCGTCCATCC TGGGGCATGG TATTATTACT TTCCAGTGGT TCTAGTCGCC TTAATTCCTT GGTCAATTTT TCTGCCCTTA GCCATTGCTC GTTTACGCTT TTGGAGACGT AACAATTGGG TTAATTCTGA ACGTTCAACC CATCTAGGAT TATTTGCCTT AGTTTGGTTT CTTGTTATCT TTGTTTTCTT TTCTAGTTCC GTTACAAAAT TAGCCGGGTA TGTTCTTCCT TTAGTCCCTG CTGCTGCCAT TCTGATTGCC CTGTTTTGGA GTGATCAAAC GACGCAAAAA GCAACCCAAA ACCAAGGAAT TTGGCCGCTA TTATTTATGC TGAGTGGATT AGCTAATATA GGAATTTTAG GCGGTTTAGC GGTAGCGAGT TTTCTCAGTC CTCAATTGGT GGGTAATGAC CCAATGATGC CCCAATTTAA CCAACTATTA CAAGCATCCG GTTTACCCAT CAGAAGCGGA ATTATTTGGG CTTTTTCTAC CTTGGGTGTT TTAATTTTAT TGCTTTGGAA AGGTCATAAA CGGTGGATTT GGGCAGCAAA TCTATTAGGA TTTATGGCAT TTTTTAGTTG GGTTGTACTT CCTATTGCCC CCATTATTGA CAGCGAACGT CAACTTCCTT TAAGAGAATT AGCTACCTTA GTTAAGAAAG AACAAAAGCC TGGAGAAAAG TTAGTTTTAC TCGGTTTTAT GCGTCCAAGT ATGGTTTATT ACACACAACA AACCGTTGAT TCTATTACCG AAACTGACAT CTACAGTGGA CAAGCAATTG AATACTTTAA ACAGCAACTC AATACCCCAG GAACTTCTCC AACAACCTTA ATTGTTGCTA AACCAAGATA CTTTGAAAAA CTGCGTTTAA ACCCTTCTGA TTATGAAGTG ATTGGTGACG GAAAAGTCTA TCAATTAATT CGCGTCTCTA AAGATAAGGT AATCCAGAAG AACTCATAA
|
Protein sequence | MHKITLQQEP IEPEILAKKS PKTLWIFSIL WLLLISWIAF LWNLGAIGLV DKTEPMFVEA SRQMALTGDW ITPYWNGETR FDKPPLTYWL ISLSFRVFGV NEWAARFPSA VFAIALVCLG FYTLRYWGFS DTNVLGSSTQ DKNNSNEQQL WFSAIIGSAI IALNPFWIAW GRTGVSDMFL SSSIGLALLS FFIGYVQPKT ALPKPLGLSV KNWWYVGYWV FMALGVLAKG PVALVLPGFV VIAFLLYVGR FLEVVKETPW LLGIGSFILI AVPWFILVTL EHGQEYIDTF FGFHNVQRFT SVVSVHPGAW YYYFPVVLVA LIPWSIFLPL AIARLRFWRR NNWVNSERST HLGLFALVWF LVIFVFFSSS VTKLAGYVLP LVPAAAILIA LFWSDQTTQK ATQNQGIWPL LFMLSGLANI GILGGLAVAS FLSPQLVGND PMMPQFNQLL QASGLPIRSG IIWAFSTLGV LILLLWKGHK RWIWAANLLG FMAFFSWVVL PIAPIIDSER QLPLRELATL VKKEQKPGEK LVLLGFMRPS MVYYTQQTVD SITETDIYSG QAIEYFKQQL NTPGTSPTTL IVAKPRYFEK LRLNPSDYEV IGDGKVYQLI RVSKDKVIQK NS
|
| |