Gene PCC8801_1088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1088 
Symbol 
ID7102256 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp1143489 
End bp1145180 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content45% 
IMG OID643474180 
Producttransglutaminase domain protein 
Protein accessionYP_002371318 
Protein GI218245947 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTCCTA ATTCCGACCT ATTTGCGTCT TCACCAAAAT ACTCTAGCGT CTCTACAACA 
ATACGCCCCA TCGCTGCGGC AGAAATTCAT GGCATCACCT TTCGAGGAGA ATTATTATTA
GCGATCGATA GTCGCAATGG CTATCTGTTG CAAATTGATC CTATCACCCA TAATACGGAA
ATTCTCAACA CCGAACACTG GGAAAACTTT ATCGGAACAA CAGGAATAGC GATCACAGGA
GATACCCTTT GGTTTACGTC AGGACGAAGT ATTTATCGCT GTTCCCTCAG TAGCGGAGAC
TTTGCGGCTG AAGTCTTCAC TCGTTTAGAC TATGCAGCCG TTGGGTTAGC GGTTTGGGAC
TCGACTATTT ACGTCTCTTG CCAAAAAACC GGGGATATTT TGGTCTTAAA TGGGGAAACT
GGCGAACAAA TTACCCGTTT ATACGCCCCC GGTATTGGCA ACGAAAACCT GACCATTCGC
GGGGAAGAAT TATGGGTGAC AGATAGTTTA GAACAGTCAG TGTACTGTCT CGATCGCGCC
ACGGGTCAAA TTATCTTTAG TGTCTTAACC CCCTTTGAGT CCCCCACAGG GTTAACGTTT
CTGCGCAACT CCCAAACAGG GGAAGATACC TTATATGTAG CCTACGTTAA TTATGAACCG
TATATTCGGG ATAATCCTAA CGCTGATCCC AACCATGAAC TCCTCTATCG GAGTCGAACC
TTTATCCATC CCTTATACTT CCATTATGAT CCAGAGAATC ATTATGCCTT GTCTAATGGG
TATTTGGTGG AAATGTCCTA TGTGGAAGAA CTCGAACCCC TTGATCAGAT CGAATTAACG
AATCTAGAGT GGCGTATTGC CCTCCCGGCG GAAACCCACC GTCAAAAGGT GAGAAAAGTT
GAACCCATTG GACTACCCTT TACAGAAGAG GTGGAAAATG GGCAAAAAGT AGCGGTATTT
AAGTTTGATA AGCTAACATC CCAAAATCGT TGTGTTTTTG GTTGGAAAGC GTTATTAGAG
GTTTCGAGTA TTAAATATCG CCTAACGCCG CGAGACTGCG AGAATATCCC TGATCTGCCG
CCAGAATACA GCGATCGCTA TCTGATTGAT AACGATAATT TAGCCATGGA TACCGATATT
ATCAAACGCG CCGCAGTAGA AGCAGCCGAA CGGGAAACGA ATTTGCTCAG AAAGGTCTAT
AGTATTCGTA ACTACGTCTA TGATCGCCTT TCCTACGGCA TTAAACCCCA TATTGATACC
CCTGATATTG CCTTACGACG AGGGGTCGGT TCCTGTGGGG AATATGTGGG GGTTTTGTTA
GCGTTATTTC GTCTGAATGG GATAGCTTGT CGTACGGTTG GACGGTATAA GTGTCCTCCC
TCTCCGTTAA CGCGAAATCA GCCCCTAGAA CCGGATTATA ATCATGTTTG GTTGGAGTTT
TATATTCCGA GTATCGGCTG GTTGCCCATG GAGTCTAACC CCGATGATAT TATTGATGGT
GGCCCCTATC CGACGCGGTT TTTTATGGGG TTAGCTTGGT ATCACGCAGA AATGGCTAAA
GATACGCCTT TTGAACGGTT GATTAGTAAT GGTATTCCGT TGAATAAACA GCAAGTTTCT
ATCGGATCTT TGGCCATTAA TCATGTTCGG TTTACCATCC TTGAAGAGTT AGATCCGGCT
AAACGTTCTT GA
 
Protein sequence
MIPNSDLFAS SPKYSSVSTT IRPIAAAEIH GITFRGELLL AIDSRNGYLL QIDPITHNTE 
ILNTEHWENF IGTTGIAITG DTLWFTSGRS IYRCSLSSGD FAAEVFTRLD YAAVGLAVWD
STIYVSCQKT GDILVLNGET GEQITRLYAP GIGNENLTIR GEELWVTDSL EQSVYCLDRA
TGQIIFSVLT PFESPTGLTF LRNSQTGEDT LYVAYVNYEP YIRDNPNADP NHELLYRSRT
FIHPLYFHYD PENHYALSNG YLVEMSYVEE LEPLDQIELT NLEWRIALPA ETHRQKVRKV
EPIGLPFTEE VENGQKVAVF KFDKLTSQNR CVFGWKALLE VSSIKYRLTP RDCENIPDLP
PEYSDRYLID NDNLAMDTDI IKRAAVEAAE RETNLLRKVY SIRNYVYDRL SYGIKPHIDT
PDIALRRGVG SCGEYVGVLL ALFRLNGIAC RTVGRYKCPP SPLTRNQPLE PDYNHVWLEF
YIPSIGWLPM ESNPDDIIDG GPYPTRFFMG LAWYHAEMAK DTPFERLISN GIPLNKQQVS
IGSLAINHVR FTILEELDPA KRS