Gene PCC8801_1905 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1905 
Symbol 
ID7102861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp1982061 
End bp1983626 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content47% 
IMG OID643474966 
ProductThiJ/PfpI domain protein 
Protein accessionYP_002372099 
Protein GI218246728 
COG category[R] General function prediction only 
COG ID[COG0693] Putative intracellular protease/amidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTTATC TACCATTACA AGGCAAAAAA ATCGCTATTC TAGTCAATTC ACAGTATATC 
GCTCAAGAAA TTAAAGGATA CCAAGAAAAA TTTACCGCTT ATGGGGCAAA AGTTGACTTG
ATGTCTCGAC TGTGGGGACA AACTGAGCAA ACCTTCGTCA GTGAAGTGGA ACAAGAAGGA
AAAACCCCCG AAACCCTGAC AGTTTGGATC GATTTTACCC AAGTTAATCT CAATGACTAC
GCCGCCGTCA TTATGGCGGC GAATTATCCC AGTGTGCGGT TACGTTGGCT AAGCGATCAA
GATGCCTCCG GACAACCTAT CAACAACAGT AGTGGTCGTC TTTCCCCTGC GGTACAATTC
ATCTATCAAG CCATGATGAA CCCTAAAATC ATCAAAGGCT TTCCTTGTCA TGCGTTATGG
CTTCTAACCC CTATTCCTGA AGTCTTAGCG GGTCGCAAAG TCACTTGTAA CCGCGTGATG
CTAGGGGATG TTAGTAACGC TGGAGCAATT ATTAGTGAAA CAGCCAGTGG GGTTGTCGTA
GATAGCGATA TCGTGACCAG TGACAGCGAT AGTCACCGAG AAGCGTTTAT TGAGGCGATT
TGTCAACAAA TTCAAGCCGT AGACCAAGGA ACCCTACAAC CCGCTATCAC GGCTGCTACG
ACTCCTTCTG CTAACGTCTC GGTTGAGTCC GTTATTCCCT ATCTACGAGA ACGCAAAATT
TTGATCCTTC TCTCAGAATG GGGTTACTGG GGAGAAGAAT TAGTCGGTCC GTTAGAAACA
TTTGACAAAG TGGGGTATCA AGTATCTTTC TGTACCCCCA CTGGCCGAAG ACCGAACGCG
ATCGCGGTTT CCATGGACCC CCTTTATATC GATCCTCCTC TGGGTCGTTC TGTCACCTGC
GTAGCGATGG CCAAAAAAGT CGCTGAAATT GATGATCCGA GTACCAATCA GGGGAAACGA
CTCGATACCC CGATCAATTT GAGGCAATGG TTTCCCGAAC GTCCCTATTG GTCTGATTCC
CAATTAGTAC GGTTAATGGA GATTTACTAC GAACGCCTCA GACGAGCCCA AGAAAGCCTT
GATGAGTTCG ATGCCTTATT AATTGTCGGG GGTAGTGGTC CTATCGTCGA TTTAGCCAAT
AATCAACGGG TTCACGACTT AATTCTCGGT TTCTATGGAC AAGGCAAACC CGTCGCGGCC
GAATGCTATG GGGTCACTTG TTTGGCTTTT GCTCGCAATA TCGAGAACAA ACAATCGATT
ATTTGGGGTA AGCAGGTCAC AGGACATTGT ATCGAATACG ATTACAAGGA TGGAACTGGG
TTTATGCGAT CGCGCGGTCA ATTCCTCGAT TTCAACATGG GACCCCCACC CTATCCCCTA
GAATACATTC TACGGGATGC TACAGGACCT GACGGAGCTT ATATCGGTAA TTTTGGCCAT
CCCACCAGTG TGATTGTGGA TTATCCCTTT ATTACGGGAC GGTCTACCCC GGATTCCTAT
TTAACGGGAC AAAAACTCGT TGAAGTCCTC GATGGGGAAC CCCCTCTGCG TCGTTGGGGT
TGGTAG
 
Protein sequence
MSYLPLQGKK IAILVNSQYI AQEIKGYQEK FTAYGAKVDL MSRLWGQTEQ TFVSEVEQEG 
KTPETLTVWI DFTQVNLNDY AAVIMAANYP SVRLRWLSDQ DASGQPINNS SGRLSPAVQF
IYQAMMNPKI IKGFPCHALW LLTPIPEVLA GRKVTCNRVM LGDVSNAGAI ISETASGVVV
DSDIVTSDSD SHREAFIEAI CQQIQAVDQG TLQPAITAAT TPSANVSVES VIPYLRERKI
LILLSEWGYW GEELVGPLET FDKVGYQVSF CTPTGRRPNA IAVSMDPLYI DPPLGRSVTC
VAMAKKVAEI DDPSTNQGKR LDTPINLRQW FPERPYWSDS QLVRLMEIYY ERLRRAQESL
DEFDALLIVG GSGPIVDLAN NQRVHDLILG FYGQGKPVAA ECYGVTCLAF ARNIENKQSI
IWGKQVTGHC IEYDYKDGTG FMRSRGQFLD FNMGPPPYPL EYILRDATGP DGAYIGNFGH
PTSVIVDYPF ITGRSTPDSY LTGQKLVEVL DGEPPLRRWG W