Gene PCC8801_3693 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3693 
Symbol 
ID7102940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp3880349 
End bp3882655 
Gene Length2307 bp 
Protein Length768 aa 
Translation table11 
GC content41% 
IMG OID643476705 
Producttransglutaminase domain protein 
Protein accessionYP_002373808 
Protein GI218248437 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAATA GCCCCTCAAA GCAATTATCT TTTTTTGAGA AATTAACCCA AACAATCGGT 
TCAACGCCCC TACCGCCTAC AGAAGAGTCC TTAGCGTTGC GAATTTTAGT CCAGGCCTTA
GTGATTGTCG GAATTATTGC TACCGATGTT GCTGCCGAAA CCCAGATGAG TTTGTGGGCA
ATTCCTCTGA GTATTGTCGG GGCGACTTGG AGTTGGTATC GTCGCAACTA TCGCAACATT
ACCATCAAGT TTTTCCTAGC CCTAGGAATG CTAGTGGTTT TATTTGTTTT CTTGATTAAT
TTAGCGCAAA ATTCTAATGA TACTCGCCTA GCATTAGCGG GCTTATTAAT ACAATTACAG
GTACTTCATA GCTTTGATTT ACCTCGACGC AAAGACTTAG GCTACTCGAT GGTCATTGGT
TTAATTCTAC TCGGAGTCGC GGGAACCGTT TCTCAAACCC TGGTATTTGC CCTGTGGCTG
TTCCTTTTCT TAGCGATCGC CCTTCCTACC CTAGTCCTTG ATTATCGTTC CCGTTTAGGC
TTAAAAGCCG TTGATCAGCA GTTTTTCAAG CAATTTAGCT TATCGTCTCG ATCGAGTTCT
CCCCTCTCAT TACGTCGTCT GGTTTGGATT TTGGGGATTA CCGTGGGGTT AGGACTAACC
ATTTTTGCCA TTATGCCCCG GTTTCCAGGG TATCAGTTAC AAACCTTTCC CATGAGTAGT
CCAGTAGATG CCGACAGTCA ACGCTTTGAT AATCAAAACC GCAATATTTT GAATCCTGGT
TATGCCAGAA ATGGTCAAGA AAAGGGAGGT CAGGGGTTAG GAAAAGGGAC AAGTCCAACA
GAAGGAAAAG GGCAAGTAGA TCCCACTTAT TATTATGGAT TTAATGACAA GATCAATCAA
AATCTGCGGG GTCAAATGGA ACCCAAAATT GTCTTGCGGA TTCGTTCTCA AGCCCCGGGT
TTTTGGCAAG CCTTATCCTT TGATAAATAT ACCGGACAAA GGTGGGAAAT TTCGGGAGAT
AAAAAGTTAC AAATGTTAGA ACGAGATCCT TGGTCTCCTC GATTTTATTT ATCTCCTCCG
TCAACCTATA TGAATACCAA ACGGGTGATT CAAAGTTATA CCGCAGTCGC CGATTTACCC
AATGTTATTC CCTCCTTATT TGCACCTAAA TATCTTTATT TTCCCGCTAA AGAAATTGGT
CAAGATCCCC AGAAAAATTT GCGATCGCCT ATCGGATTAC TCGAAGGATT GACCTATACT
GTTATCTCAG AAGTTCCCGA ACGCGATCGA ACTAAGTTAC GCAAAGTTGT TACAGACTAT
TCCGAGCAAA TCAAAAAACT TTATCTCCAA ATCCCCCCAC AAATTGCCGA AAAAGTCCGT
CAAAAAACCT TGGAACTATT AGCAAAATCC GAAAAGCCGC TAGAGTCTGT TTACGAAAAA
GCCCTATTTT TAGCCCAAGA ATTGAAACAA AAATATCGCC TTCAACCAGA TTTACCCTTC
TTTGATGACG CTGAAGATTT AGTGACTTCT TTTCTCTTTA AACATGAAGG AGGTTATCCC
GATCATTTTT CAACAACCCT AACCATTATG TTACGGTCAA TTGGCATTCC AGCGCGGTTA
ACCGTAGGAT TTGGTCAGGG TCAATTTAAT CCGTTTACGG GGTTTTATAT TGTTAAGAAT
ACCGATGCTT ATGCCTTAAC TGAGGTCTAT TTTCCCCATG TGGGTTGGTA TAGTTTTGAT
CCCATTCCAG GTCATGATTT AATTCCCCCT TCCTTTGAAG AAGATCAAAC CTTTAGTGTC
CTGAAACAAT TTTGGAATTG GATAGCGGGT TGGTTGCCTT CTCCTATTGC TAATTTCCTA
GCTATTCTGT GGACTAAAGT TCTAGGTGGT TTCTTTAGCT TTTTGTTATG GCTATGGCAC
TTTATTTCAG GGAGTTTAAT AGGCGTTTTT GTCGGCTTAG TATTAACGAT TATTGTAGGG
TTTTTAGGGT GGTTAGGTTG GGGACAATTA GCAACATTAG GTTATCGTCG TCGCTTGGCT
AAATTACCCC CAATGGCACG ATTATATGAG CAGATGTTAG GAGTATTAAA AGAGAAAGGT
TATCCCAAAC ATCCCGCACA AACTCCACTA GAATATGTTC AAGTTTCGCG TCAACATCAA
TCCTCAGAAC AAGCAGACAT TATTGAGGAA ATTTCCCAAG CGTATGTCAG TTGGCGATAT
GGAGAAAATC CCCAAAATTT GGACTATTTA AAACAACAAT TTCAAGCATT AATGCGAAGT
TTAAAACGAG GTAAATCAAG TTCGTAA
 
Protein sequence
MENSPSKQLS FFEKLTQTIG STPLPPTEES LALRILVQAL VIVGIIATDV AAETQMSLWA 
IPLSIVGATW SWYRRNYRNI TIKFFLALGM LVVLFVFLIN LAQNSNDTRL ALAGLLIQLQ
VLHSFDLPRR KDLGYSMVIG LILLGVAGTV SQTLVFALWL FLFLAIALPT LVLDYRSRLG
LKAVDQQFFK QFSLSSRSSS PLSLRRLVWI LGITVGLGLT IFAIMPRFPG YQLQTFPMSS
PVDADSQRFD NQNRNILNPG YARNGQEKGG QGLGKGTSPT EGKGQVDPTY YYGFNDKINQ
NLRGQMEPKI VLRIRSQAPG FWQALSFDKY TGQRWEISGD KKLQMLERDP WSPRFYLSPP
STYMNTKRVI QSYTAVADLP NVIPSLFAPK YLYFPAKEIG QDPQKNLRSP IGLLEGLTYT
VISEVPERDR TKLRKVVTDY SEQIKKLYLQ IPPQIAEKVR QKTLELLAKS EKPLESVYEK
ALFLAQELKQ KYRLQPDLPF FDDAEDLVTS FLFKHEGGYP DHFSTTLTIM LRSIGIPARL
TVGFGQGQFN PFTGFYIVKN TDAYALTEVY FPHVGWYSFD PIPGHDLIPP SFEEDQTFSV
LKQFWNWIAG WLPSPIANFL AILWTKVLGG FFSFLLWLWH FISGSLIGVF VGLVLTIIVG
FLGWLGWGQL ATLGYRRRLA KLPPMARLYE QMLGVLKEKG YPKHPAQTPL EYVQVSRQHQ
SSEQADIIEE ISQAYVSWRY GENPQNLDYL KQQFQALMRS LKRGKSSS