Gene Cyan8802_3747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_3747 
Symbol 
ID8393095 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp3858655 
End bp3860961 
Gene Length2307 bp 
Protein Length768 aa 
Translation table11 
GC content41% 
IMG OID644981676 
Producttransglutaminase domain protein 
Protein accessionYP_003139392 
Protein GI257061504 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.664376 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.434261 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAATA GCCCCTCAAA GCAATTATCT TTTTTTGAGA AATTAACCCA AACAATCGGT 
TCAACCCCCC TACCGCCTAC AGAAGAGTCC TTAGCGTTGC GAATTTTAGT CCAGGCCTTA
GTGATTGTCG GAATTATTGC TACCGATGTT GCTGCCGAAA CCCAGATGAG TTTGTGGGCA
ATTCCTCTGA GTATTGTCGG GGCAACCTGG AGTTGGTATC GCCGCAAATA TCGCAACATT
ACCATCAAGT TTTTCCTAGC CATAGGAATG CTAGTGGTTT TATTTGTTTT CTTAATTAAT
TTAGCGCAAA ATTCTAACGA TACTCGCCTA GCATTAGCAG GCTTATTAAT ACAATTACAG
GTACTCCATA GCTTTGATTT GCCTCGACGC AAAGACTTAG GCTACTCGAT GGTCATTGGT
TTAATTTTAC TCGGAGTTGC CGGAACCGTT TCTCAAACCC TGGTATTTGC CCTGTGGCTG
TTCCTTTTTT TAGCGATCGC CCTTCCTACT CTAGTCCTTG ATTATCGTTC CCGTTTAGGC
TTAAAAGCCG TTGATCAGCA GTTTTTCAAG CAATTTAGCT TATCGTCTCG ATCGAGTTCT
CCCCTCTCAT TACGTCGTCT GGTTTGGATT TTGGGGATTA CCGTGGGGTT AGGACTAACC
ATTTTTGCCA TTATGCCCCG GTTTCCAGGG TATCAGTTAC AAACCTTTCC CATGAGTAGT
CCAGTAGATG CCGACAGTCA ACGCTTTGAT AATCAAAACC GCAATATTTT GAATCCTGGT
TATGCCAGAA ATGGTCAAGA AAAGGGAGGT CAGGGGTTAG GAAAAGGGAC AAGTCCAACA
GAAGGAAAAG GGCAAGTAGA TCCCACTTAT TATTATGGAT TTAATGACAA GATCAATCAA
AATCTGCGGG GTCAAATGGA ACCCAAAATT GTTTTGCGGA TTCGTTCTCA AGCCCCCGGT
TTTTGGCAAG CCTTATCTTT TGATAAATAT ACCGGACAAG GGTGGGAAAT TTCGGGAGAT
AAAAAGTTAC AAATGTTGGA ACGAGATCCT TGGTCCCCTC GATTTTATTT ATCTCCTGCT
TCGACCTATA TGAACACCAA ACAGGTGATT CAAAGTTATA CCGCAGTCGC CGATTTACCC
AACGTCATTC CCTCCTTATT TGCGCCTAAA TATCTCTATT TTCCCGCTAA AGAAATTGGT
CAAGATCCCC AGAAAAATTT GCGATCGCCT GTTGGATTAC TCGAAGGATT GACCTATACT
GTTATCTCAG AAGTTCCTGA ACGCGATCGC ACCAAGTTAC GCAAAGTTGT TACAGACTAT
TCCGATAAAA TTAAACAAAC CTATCTCCAA ATCCCCCCAC AAATTGCCGA AAAAGTCCGA
GAAAAAACCT TACAATTATT AGCAAAATCC GAAAAACCCC TAGATTCTGT TTACGAAAAA
GCTATATTTT TAGCCCAAGA ATTGAAGCAA AAATATCGCC TTCAACCAGA TTTACCCTTC
TTTGATGACG GTGAAGATTT AGTAACAGGT TTTCTCTTTA AATATGAAGG AGGCTATCCC
GATCATTTTT CAACAACCTT AACCATTATG TTACGGTCAA TTGGCATTCC TGCTAGGTTA
ACTGTAGGAT TTGGTCAGGG TCAATTTAAT CCGTTTACGG GGTTTTATAT TGTTAAAAAT
ACCGATGCTT ATGCCTTAAC TGAGGTCTAT TTTCCCCATG TGGGTTGGTA TAGTTTTGAT
CCCATTCCAG GTCATGATTT AATTCCCCCT TCCTTTGAAG AAGATCAAAC TTTTAGTGTC
CTTAAACAAT TTTGGAATTG GATAGCGGGT TGGTTGCCTT CTCCTATTGC TAATTTCCTA
GCTATTCTGT GGACTAAAGT TCTGGGGGGT TTCTTTAGCT TTTTGTTATG GCTATGGCAC
TTTATTTCAG GTAGTTTAAT CGGCGTTTTT GTTGGTTTAT TATTAACGGT TATTTTAGGC
TTTTTAGGCT GGTTAGGGTG GGGACAATTA ACGACCTTAG CTTATCGTCG TCGCTTAGCT
AAATTACCCC CAATGGCACG ATTGTATGAG CAAATGTTAG GATTATTAAA AGAGAAAGGT
TATCCCAAGC ATCCCGCACA AACTCCTTTA GAATATGTCC AAGTTTCGCG TCAACATCAA
TCCTCAGAAC AAGCAGACAT TATTGAGGAA ATTTCCCAAG CGTATGTCAG TTGGCGATAT
GGAGAAAATC CCCAAAATTT GGACTATTTA AAACAGCAAT TTCAAGCATT AATGCGAAGT
TTAAAACGAG GTAAATCAAG TTCGTAA
 
Protein sequence
MENSPSKQLS FFEKLTQTIG STPLPPTEES LALRILVQAL VIVGIIATDV AAETQMSLWA 
IPLSIVGATW SWYRRKYRNI TIKFFLAIGM LVVLFVFLIN LAQNSNDTRL ALAGLLIQLQ
VLHSFDLPRR KDLGYSMVIG LILLGVAGTV SQTLVFALWL FLFLAIALPT LVLDYRSRLG
LKAVDQQFFK QFSLSSRSSS PLSLRRLVWI LGITVGLGLT IFAIMPRFPG YQLQTFPMSS
PVDADSQRFD NQNRNILNPG YARNGQEKGG QGLGKGTSPT EGKGQVDPTY YYGFNDKINQ
NLRGQMEPKI VLRIRSQAPG FWQALSFDKY TGQGWEISGD KKLQMLERDP WSPRFYLSPA
STYMNTKQVI QSYTAVADLP NVIPSLFAPK YLYFPAKEIG QDPQKNLRSP VGLLEGLTYT
VISEVPERDR TKLRKVVTDY SDKIKQTYLQ IPPQIAEKVR EKTLQLLAKS EKPLDSVYEK
AIFLAQELKQ KYRLQPDLPF FDDGEDLVTG FLFKYEGGYP DHFSTTLTIM LRSIGIPARL
TVGFGQGQFN PFTGFYIVKN TDAYALTEVY FPHVGWYSFD PIPGHDLIPP SFEEDQTFSV
LKQFWNWIAG WLPSPIANFL AILWTKVLGG FFSFLLWLWH FISGSLIGVF VGLLLTVILG
FLGWLGWGQL TTLAYRRRLA KLPPMARLYE QMLGLLKEKG YPKHPAQTPL EYVQVSRQHQ
SSEQADIIEE ISQAYVSWRY GENPQNLDYL KQQFQALMRS LKRGKSSS