Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_1929 |
Symbol | |
ID | 8391244 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | + |
Start bp | 1951580 |
End bp | 1953163 |
Gene Length | 1584 bp |
Protein Length | 527 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 644979909 |
Product | ThiJ/PfpI domain protein |
Protein accession | YP_003137655 |
Protein GI | 257059767 |
COG category | [R] General function prediction only |
COG ID | [COG0693] Putative intracellular protease/amidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAGTAA GTAAAATTAT GTCTTATCTA CCATTACAAG GCAAAAAAAT CGCTATTCTA GTCAATTCAC AGTATATCGC TCAAGAAATT AAAGGATACC AAGAAAAATT TACCGCTTAT GGGGCAAAAG TTGACTTGAT GTCTCGACTG TGGGGACAAA CTGAGCAAAC CTTCGTCAGT GAAGTGGAAC AAGAAGGAAA AACCCCCGAA ACCCTGACAG TTTGGATCGA TTTTACCCAA GTTAATCTCA ATGACTACGC CGCCGTCATT ATGGCGGCGA ATTATCCCAG TGTGCGGTTA CGTTGGCTAA GCGATCAAGA TGCCTCCGGA CAACCTATCA ACAACAGTAG TGGTCGTCTT TCCCCTGCGG TACAATTCAT CTATCAAGCC ATGATGAACC CTAAAATCAT CAAAGGCTTT CCTTGTCATG CGTTATGGCT TCTAACCCCT ATTCCTGAAG TCTTAGCGGG TCGCAAAGTC ACTTGTAACC GCGTGATGCT AGGGGATGTT AGTAACGCTG GAGCAATTAT TAGTGAAACA GCCAGTGGGG TTGTCGTAGA TAGCGATATC GTGACCAGTG ACAGCGATAG TCACCGAGAA GCGTTTATTG AGGCGATTTG TCAACAAATT CAAGCCGTAG ACCAAGGAAC CCTACAACCC GCTATCACGG CTGCTACGAC TCCTTCTGCT AACGTCTCGG TTGAGTCCGT TATTCCCTAT CTACGAGAAC GCAAAATTTT GATCCTTCTC TCAGAATGGG GTTACTGGGG AGAAGAATTA GTCGGTCCGT TAGAAACATT TGACAAAGTG GGGTATCAAG TATCTTTCTG TACCCCCACT GGCCGAAGAC CGAACGCGAT CGCGGTTTCC ATGGACCCCC TTTATATCGA TCCTCCTCTG GGTCGTTCTG TCACCTGCGT AGCGATGGCC AAAAAAGTCG CTGAAATTGA TGATCCGAGT ACCAATCAGG GGAAACGACT CGATACCCCG ATCAATTTGA GGCAATGGTT TCCCGAACGT CCCTATTGGT CTGATTCCCA ATTAGTACGG TTAATGGAGA TTTACTACGA ACGCCTCAGA CGAGCCCAAG AAAGCCTTGA TGAGTTCGAT GCCTTATTAA TTGTCGGGGG TAGTGGTCCT ATCGTCGATT TAGCCAATAA TCAACGGGTT CACGACTTAA TTCTCGGTTT CTATGGACAA GGCAAACCCG TCGCGGCCGA ATGCTATGGG GTCACTTGTT TGGCTTTTGC TCGCAATATC GAGAACAAAC AATCGATTAT TTGGGGTAAG CAGGTCACAG GACATTGTAT CGAATACGAT TACAAGGATG GAACTGGGTT TATGCGATCG CGTGGTCAAT TCCTCGATTT CAACATGGGA CCCCCACCCT ATCCCCTAGA ATACATTCTA CGGGATGCTA CAGGACCTGA CGGAGCTTAT ATCGGTAATT TTGGTCATCC CACCAGTGTT ATTGTGGATT ATCCCTTTAT TACGGGACGG TCTACCCCGG ATTCCTATTT AACGGGACAA AAACTCGTTG AAGTTCTCGA TGGGGAACCC CCTCTGCGTC GTTGGGGTTG GTAG
|
Protein sequence | MGVSKIMSYL PLQGKKIAIL VNSQYIAQEI KGYQEKFTAY GAKVDLMSRL WGQTEQTFVS EVEQEGKTPE TLTVWIDFTQ VNLNDYAAVI MAANYPSVRL RWLSDQDASG QPINNSSGRL SPAVQFIYQA MMNPKIIKGF PCHALWLLTP IPEVLAGRKV TCNRVMLGDV SNAGAIISET ASGVVVDSDI VTSDSDSHRE AFIEAICQQI QAVDQGTLQP AITAATTPSA NVSVESVIPY LRERKILILL SEWGYWGEEL VGPLETFDKV GYQVSFCTPT GRRPNAIAVS MDPLYIDPPL GRSVTCVAMA KKVAEIDDPS TNQGKRLDTP INLRQWFPER PYWSDSQLVR LMEIYYERLR RAQESLDEFD ALLIVGGSGP IVDLANNQRV HDLILGFYGQ GKPVAAECYG VTCLAFARNI ENKQSIIWGK QVTGHCIEYD YKDGTGFMRS RGQFLDFNMG PPPYPLEYIL RDATGPDGAY IGNFGHPTSV IVDYPFITGR STPDSYLTGQ KLVEVLDGEP PLRRWGW
|
| |