Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_3747 |
Symbol | |
ID | 8393095 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | - |
Start bp | 3858655 |
End bp | 3860961 |
Gene Length | 2307 bp |
Protein Length | 768 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 644981676 |
Product | transglutaminase domain protein |
Protein accession | YP_003139392 |
Protein GI | 257061504 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.664376 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.434261 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAATA GCCCCTCAAA GCAATTATCT TTTTTTGAGA AATTAACCCA AACAATCGGT TCAACCCCCC TACCGCCTAC AGAAGAGTCC TTAGCGTTGC GAATTTTAGT CCAGGCCTTA GTGATTGTCG GAATTATTGC TACCGATGTT GCTGCCGAAA CCCAGATGAG TTTGTGGGCA ATTCCTCTGA GTATTGTCGG GGCAACCTGG AGTTGGTATC GCCGCAAATA TCGCAACATT ACCATCAAGT TTTTCCTAGC CATAGGAATG CTAGTGGTTT TATTTGTTTT CTTAATTAAT TTAGCGCAAA ATTCTAACGA TACTCGCCTA GCATTAGCAG GCTTATTAAT ACAATTACAG GTACTCCATA GCTTTGATTT GCCTCGACGC AAAGACTTAG GCTACTCGAT GGTCATTGGT TTAATTTTAC TCGGAGTTGC CGGAACCGTT TCTCAAACCC TGGTATTTGC CCTGTGGCTG TTCCTTTTTT TAGCGATCGC CCTTCCTACT CTAGTCCTTG ATTATCGTTC CCGTTTAGGC TTAAAAGCCG TTGATCAGCA GTTTTTCAAG CAATTTAGCT TATCGTCTCG ATCGAGTTCT CCCCTCTCAT TACGTCGTCT GGTTTGGATT TTGGGGATTA CCGTGGGGTT AGGACTAACC ATTTTTGCCA TTATGCCCCG GTTTCCAGGG TATCAGTTAC AAACCTTTCC CATGAGTAGT CCAGTAGATG CCGACAGTCA ACGCTTTGAT AATCAAAACC GCAATATTTT GAATCCTGGT TATGCCAGAA ATGGTCAAGA AAAGGGAGGT CAGGGGTTAG GAAAAGGGAC AAGTCCAACA GAAGGAAAAG GGCAAGTAGA TCCCACTTAT TATTATGGAT TTAATGACAA GATCAATCAA AATCTGCGGG GTCAAATGGA ACCCAAAATT GTTTTGCGGA TTCGTTCTCA AGCCCCCGGT TTTTGGCAAG CCTTATCTTT TGATAAATAT ACCGGACAAG GGTGGGAAAT TTCGGGAGAT AAAAAGTTAC AAATGTTGGA ACGAGATCCT TGGTCCCCTC GATTTTATTT ATCTCCTGCT TCGACCTATA TGAACACCAA ACAGGTGATT CAAAGTTATA CCGCAGTCGC CGATTTACCC AACGTCATTC CCTCCTTATT TGCGCCTAAA TATCTCTATT TTCCCGCTAA AGAAATTGGT CAAGATCCCC AGAAAAATTT GCGATCGCCT GTTGGATTAC TCGAAGGATT GACCTATACT GTTATCTCAG AAGTTCCTGA ACGCGATCGC ACCAAGTTAC GCAAAGTTGT TACAGACTAT TCCGATAAAA TTAAACAAAC CTATCTCCAA ATCCCCCCAC AAATTGCCGA AAAAGTCCGA GAAAAAACCT TACAATTATT AGCAAAATCC GAAAAACCCC TAGATTCTGT TTACGAAAAA GCTATATTTT TAGCCCAAGA ATTGAAGCAA AAATATCGCC TTCAACCAGA TTTACCCTTC TTTGATGACG GTGAAGATTT AGTAACAGGT TTTCTCTTTA AATATGAAGG AGGCTATCCC GATCATTTTT CAACAACCTT AACCATTATG TTACGGTCAA TTGGCATTCC TGCTAGGTTA ACTGTAGGAT TTGGTCAGGG TCAATTTAAT CCGTTTACGG GGTTTTATAT TGTTAAAAAT ACCGATGCTT ATGCCTTAAC TGAGGTCTAT TTTCCCCATG TGGGTTGGTA TAGTTTTGAT CCCATTCCAG GTCATGATTT AATTCCCCCT TCCTTTGAAG AAGATCAAAC TTTTAGTGTC CTTAAACAAT TTTGGAATTG GATAGCGGGT TGGTTGCCTT CTCCTATTGC TAATTTCCTA GCTATTCTGT GGACTAAAGT TCTGGGGGGT TTCTTTAGCT TTTTGTTATG GCTATGGCAC TTTATTTCAG GTAGTTTAAT CGGCGTTTTT GTTGGTTTAT TATTAACGGT TATTTTAGGC TTTTTAGGCT GGTTAGGGTG GGGACAATTA ACGACCTTAG CTTATCGTCG TCGCTTAGCT AAATTACCCC CAATGGCACG ATTGTATGAG CAAATGTTAG GATTATTAAA AGAGAAAGGT TATCCCAAGC ATCCCGCACA AACTCCTTTA GAATATGTCC AAGTTTCGCG TCAACATCAA TCCTCAGAAC AAGCAGACAT TATTGAGGAA ATTTCCCAAG CGTATGTCAG TTGGCGATAT GGAGAAAATC CCCAAAATTT GGACTATTTA AAACAGCAAT TTCAAGCATT AATGCGAAGT TTAAAACGAG GTAAATCAAG TTCGTAA
|
Protein sequence | MENSPSKQLS FFEKLTQTIG STPLPPTEES LALRILVQAL VIVGIIATDV AAETQMSLWA IPLSIVGATW SWYRRKYRNI TIKFFLAIGM LVVLFVFLIN LAQNSNDTRL ALAGLLIQLQ VLHSFDLPRR KDLGYSMVIG LILLGVAGTV SQTLVFALWL FLFLAIALPT LVLDYRSRLG LKAVDQQFFK QFSLSSRSSS PLSLRRLVWI LGITVGLGLT IFAIMPRFPG YQLQTFPMSS PVDADSQRFD NQNRNILNPG YARNGQEKGG QGLGKGTSPT EGKGQVDPTY YYGFNDKINQ NLRGQMEPKI VLRIRSQAPG FWQALSFDKY TGQGWEISGD KKLQMLERDP WSPRFYLSPA STYMNTKQVI QSYTAVADLP NVIPSLFAPK YLYFPAKEIG QDPQKNLRSP VGLLEGLTYT VISEVPERDR TKLRKVVTDY SDKIKQTYLQ IPPQIAEKVR EKTLQLLAKS EKPLDSVYEK AIFLAQELKQ KYRLQPDLPF FDDGEDLVTG FLFKYEGGYP DHFSTTLTIM LRSIGIPARL TVGFGQGQFN PFTGFYIVKN TDAYALTEVY FPHVGWYSFD PIPGHDLIPP SFEEDQTFSV LKQFWNWIAG WLPSPIANFL AILWTKVLGG FFSFLLWLWH FISGSLIGVF VGLLLTVILG FLGWLGWGQL TTLAYRRRLA KLPPMARLYE QMLGLLKEKG YPKHPAQTPL EYVQVSRQHQ SSEQADIIEE ISQAYVSWRY GENPQNLDYL KQQFQALMRS LKRGKSSS
|
| |