Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_3693 |
Symbol | |
ID | 7102940 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | - |
Start bp | 3880349 |
End bp | 3882655 |
Gene Length | 2307 bp |
Protein Length | 768 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 643476705 |
Product | transglutaminase domain protein |
Protein accession | YP_002373808 |
Protein GI | 218248437 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAATA GCCCCTCAAA GCAATTATCT TTTTTTGAGA AATTAACCCA AACAATCGGT TCAACGCCCC TACCGCCTAC AGAAGAGTCC TTAGCGTTGC GAATTTTAGT CCAGGCCTTA GTGATTGTCG GAATTATTGC TACCGATGTT GCTGCCGAAA CCCAGATGAG TTTGTGGGCA ATTCCTCTGA GTATTGTCGG GGCGACTTGG AGTTGGTATC GTCGCAACTA TCGCAACATT ACCATCAAGT TTTTCCTAGC CCTAGGAATG CTAGTGGTTT TATTTGTTTT CTTGATTAAT TTAGCGCAAA ATTCTAATGA TACTCGCCTA GCATTAGCGG GCTTATTAAT ACAATTACAG GTACTTCATA GCTTTGATTT ACCTCGACGC AAAGACTTAG GCTACTCGAT GGTCATTGGT TTAATTCTAC TCGGAGTCGC GGGAACCGTT TCTCAAACCC TGGTATTTGC CCTGTGGCTG TTCCTTTTCT TAGCGATCGC CCTTCCTACC CTAGTCCTTG ATTATCGTTC CCGTTTAGGC TTAAAAGCCG TTGATCAGCA GTTTTTCAAG CAATTTAGCT TATCGTCTCG ATCGAGTTCT CCCCTCTCAT TACGTCGTCT GGTTTGGATT TTGGGGATTA CCGTGGGGTT AGGACTAACC ATTTTTGCCA TTATGCCCCG GTTTCCAGGG TATCAGTTAC AAACCTTTCC CATGAGTAGT CCAGTAGATG CCGACAGTCA ACGCTTTGAT AATCAAAACC GCAATATTTT GAATCCTGGT TATGCCAGAA ATGGTCAAGA AAAGGGAGGT CAGGGGTTAG GAAAAGGGAC AAGTCCAACA GAAGGAAAAG GGCAAGTAGA TCCCACTTAT TATTATGGAT TTAATGACAA GATCAATCAA AATCTGCGGG GTCAAATGGA ACCCAAAATT GTCTTGCGGA TTCGTTCTCA AGCCCCGGGT TTTTGGCAAG CCTTATCCTT TGATAAATAT ACCGGACAAA GGTGGGAAAT TTCGGGAGAT AAAAAGTTAC AAATGTTAGA ACGAGATCCT TGGTCTCCTC GATTTTATTT ATCTCCTCCG TCAACCTATA TGAATACCAA ACGGGTGATT CAAAGTTATA CCGCAGTCGC CGATTTACCC AATGTTATTC CCTCCTTATT TGCACCTAAA TATCTTTATT TTCCCGCTAA AGAAATTGGT CAAGATCCCC AGAAAAATTT GCGATCGCCT ATCGGATTAC TCGAAGGATT GACCTATACT GTTATCTCAG AAGTTCCCGA ACGCGATCGA ACTAAGTTAC GCAAAGTTGT TACAGACTAT TCCGAGCAAA TCAAAAAACT TTATCTCCAA ATCCCCCCAC AAATTGCCGA AAAAGTCCGT CAAAAAACCT TGGAACTATT AGCAAAATCC GAAAAGCCGC TAGAGTCTGT TTACGAAAAA GCCCTATTTT TAGCCCAAGA ATTGAAACAA AAATATCGCC TTCAACCAGA TTTACCCTTC TTTGATGACG CTGAAGATTT AGTGACTTCT TTTCTCTTTA AACATGAAGG AGGTTATCCC GATCATTTTT CAACAACCCT AACCATTATG TTACGGTCAA TTGGCATTCC AGCGCGGTTA ACCGTAGGAT TTGGTCAGGG TCAATTTAAT CCGTTTACGG GGTTTTATAT TGTTAAGAAT ACCGATGCTT ATGCCTTAAC TGAGGTCTAT TTTCCCCATG TGGGTTGGTA TAGTTTTGAT CCCATTCCAG GTCATGATTT AATTCCCCCT TCCTTTGAAG AAGATCAAAC CTTTAGTGTC CTGAAACAAT TTTGGAATTG GATAGCGGGT TGGTTGCCTT CTCCTATTGC TAATTTCCTA GCTATTCTGT GGACTAAAGT TCTAGGTGGT TTCTTTAGCT TTTTGTTATG GCTATGGCAC TTTATTTCAG GGAGTTTAAT AGGCGTTTTT GTCGGCTTAG TATTAACGAT TATTGTAGGG TTTTTAGGGT GGTTAGGTTG GGGACAATTA GCAACATTAG GTTATCGTCG TCGCTTGGCT AAATTACCCC CAATGGCACG ATTATATGAG CAGATGTTAG GAGTATTAAA AGAGAAAGGT TATCCCAAAC ATCCCGCACA AACTCCACTA GAATATGTTC AAGTTTCGCG TCAACATCAA TCCTCAGAAC AAGCAGACAT TATTGAGGAA ATTTCCCAAG CGTATGTCAG TTGGCGATAT GGAGAAAATC CCCAAAATTT GGACTATTTA AAACAACAAT TTCAAGCATT AATGCGAAGT TTAAAACGAG GTAAATCAAG TTCGTAA
|
Protein sequence | MENSPSKQLS FFEKLTQTIG STPLPPTEES LALRILVQAL VIVGIIATDV AAETQMSLWA IPLSIVGATW SWYRRNYRNI TIKFFLALGM LVVLFVFLIN LAQNSNDTRL ALAGLLIQLQ VLHSFDLPRR KDLGYSMVIG LILLGVAGTV SQTLVFALWL FLFLAIALPT LVLDYRSRLG LKAVDQQFFK QFSLSSRSSS PLSLRRLVWI LGITVGLGLT IFAIMPRFPG YQLQTFPMSS PVDADSQRFD NQNRNILNPG YARNGQEKGG QGLGKGTSPT EGKGQVDPTY YYGFNDKINQ NLRGQMEPKI VLRIRSQAPG FWQALSFDKY TGQRWEISGD KKLQMLERDP WSPRFYLSPP STYMNTKRVI QSYTAVADLP NVIPSLFAPK YLYFPAKEIG QDPQKNLRSP IGLLEGLTYT VISEVPERDR TKLRKVVTDY SEQIKKLYLQ IPPQIAEKVR QKTLELLAKS EKPLESVYEK ALFLAQELKQ KYRLQPDLPF FDDAEDLVTS FLFKHEGGYP DHFSTTLTIM LRSIGIPARL TVGFGQGQFN PFTGFYIVKN TDAYALTEVY FPHVGWYSFD PIPGHDLIPP SFEEDQTFSV LKQFWNWIAG WLPSPIANFL AILWTKVLGG FFSFLLWLWH FISGSLIGVF VGLVLTIIVG FLGWLGWGQL ATLGYRRRLA KLPPMARLYE QMLGVLKEKG YPKHPAQTPL EYVQVSRQHQ SSEQADIIEE ISQAYVSWRY GENPQNLDYL KQQFQALMRS LKRGKSSS
|
| |