Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_1905 |
Symbol | |
ID | 7102861 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | + |
Start bp | 1982061 |
End bp | 1983626 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 643474966 |
Product | ThiJ/PfpI domain protein |
Protein accession | YP_002372099 |
Protein GI | 218246728 |
COG category | [R] General function prediction only |
COG ID | [COG0693] Putative intracellular protease/amidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTTATC TACCATTACA AGGCAAAAAA ATCGCTATTC TAGTCAATTC ACAGTATATC GCTCAAGAAA TTAAAGGATA CCAAGAAAAA TTTACCGCTT ATGGGGCAAA AGTTGACTTG ATGTCTCGAC TGTGGGGACA AACTGAGCAA ACCTTCGTCA GTGAAGTGGA ACAAGAAGGA AAAACCCCCG AAACCCTGAC AGTTTGGATC GATTTTACCC AAGTTAATCT CAATGACTAC GCCGCCGTCA TTATGGCGGC GAATTATCCC AGTGTGCGGT TACGTTGGCT AAGCGATCAA GATGCCTCCG GACAACCTAT CAACAACAGT AGTGGTCGTC TTTCCCCTGC GGTACAATTC ATCTATCAAG CCATGATGAA CCCTAAAATC ATCAAAGGCT TTCCTTGTCA TGCGTTATGG CTTCTAACCC CTATTCCTGA AGTCTTAGCG GGTCGCAAAG TCACTTGTAA CCGCGTGATG CTAGGGGATG TTAGTAACGC TGGAGCAATT ATTAGTGAAA CAGCCAGTGG GGTTGTCGTA GATAGCGATA TCGTGACCAG TGACAGCGAT AGTCACCGAG AAGCGTTTAT TGAGGCGATT TGTCAACAAA TTCAAGCCGT AGACCAAGGA ACCCTACAAC CCGCTATCAC GGCTGCTACG ACTCCTTCTG CTAACGTCTC GGTTGAGTCC GTTATTCCCT ATCTACGAGA ACGCAAAATT TTGATCCTTC TCTCAGAATG GGGTTACTGG GGAGAAGAAT TAGTCGGTCC GTTAGAAACA TTTGACAAAG TGGGGTATCA AGTATCTTTC TGTACCCCCA CTGGCCGAAG ACCGAACGCG ATCGCGGTTT CCATGGACCC CCTTTATATC GATCCTCCTC TGGGTCGTTC TGTCACCTGC GTAGCGATGG CCAAAAAAGT CGCTGAAATT GATGATCCGA GTACCAATCA GGGGAAACGA CTCGATACCC CGATCAATTT GAGGCAATGG TTTCCCGAAC GTCCCTATTG GTCTGATTCC CAATTAGTAC GGTTAATGGA GATTTACTAC GAACGCCTCA GACGAGCCCA AGAAAGCCTT GATGAGTTCG ATGCCTTATT AATTGTCGGG GGTAGTGGTC CTATCGTCGA TTTAGCCAAT AATCAACGGG TTCACGACTT AATTCTCGGT TTCTATGGAC AAGGCAAACC CGTCGCGGCC GAATGCTATG GGGTCACTTG TTTGGCTTTT GCTCGCAATA TCGAGAACAA ACAATCGATT ATTTGGGGTA AGCAGGTCAC AGGACATTGT ATCGAATACG ATTACAAGGA TGGAACTGGG TTTATGCGAT CGCGCGGTCA ATTCCTCGAT TTCAACATGG GACCCCCACC CTATCCCCTA GAATACATTC TACGGGATGC TACAGGACCT GACGGAGCTT ATATCGGTAA TTTTGGCCAT CCCACCAGTG TGATTGTGGA TTATCCCTTT ATTACGGGAC GGTCTACCCC GGATTCCTAT TTAACGGGAC AAAAACTCGT TGAAGTCCTC GATGGGGAAC CCCCTCTGCG TCGTTGGGGT TGGTAG
|
Protein sequence | MSYLPLQGKK IAILVNSQYI AQEIKGYQEK FTAYGAKVDL MSRLWGQTEQ TFVSEVEQEG KTPETLTVWI DFTQVNLNDY AAVIMAANYP SVRLRWLSDQ DASGQPINNS SGRLSPAVQF IYQAMMNPKI IKGFPCHALW LLTPIPEVLA GRKVTCNRVM LGDVSNAGAI ISETASGVVV DSDIVTSDSD SHREAFIEAI CQQIQAVDQG TLQPAITAAT TPSANVSVES VIPYLRERKI LILLSEWGYW GEELVGPLET FDKVGYQVSF CTPTGRRPNA IAVSMDPLYI DPPLGRSVTC VAMAKKVAEI DDPSTNQGKR LDTPINLRQW FPERPYWSDS QLVRLMEIYY ERLRRAQESL DEFDALLIVG GSGPIVDLAN NQRVHDLILG FYGQGKPVAA ECYGVTCLAF ARNIENKQSI IWGKQVTGHC IEYDYKDGTG FMRSRGQFLD FNMGPPPYPL EYILRDATGP DGAYIGNFGH PTSVIVDYPF ITGRSTPDSY LTGQKLVEVL DGEPPLRRWG W
|
| |