Gene PCC8801_4122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4122 
Symbol 
ID7101909 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4322321 
End bp4323664 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content38% 
IMG OID643477111 
Productputative Zn-dependent protease 
Protein accessionYP_002374210 
Protein GI218248839 
COG category[R] General function prediction only 
COG ID[COG0312] Predicted Zn-dependent proteases and their inactivated homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTTAG ATCGCTTAGA ACAGTTAGAA GTAACGTTTA ATCAATTGTC TGAATTTCTG 
ATTGATCAAT TAAATAATGC GGAGCATCTT TCTCTAGAAT TAAGTAGTGA ACAAACCCAA
TTTATCCGCT TTAATAATGC AAAAGTTCGT CAAACAGGAT TGGTTACTGA TGGTAATATT
AAATTGAGTT TTATTGCTAA TCAACGCACT GTTTTTATGA TGTTTCCCTT CACGGGAGAT
CTGACGACAG ATCAACAAAA TGGTCTAGAA AGTCTTAATT ATTTACGTCA AGACATTCTC
CAAGTTCCTG AAGATCCCCA TCTTGTATTA CCTGAAAATA AGGGAACTAC AAGAGAAGTT
TATCGAGGGG ATTTATTAGT TCCAGAAATA GCGGTTAAAA CCCTTCTCCC TGAAGTACAA
AACTTGGATA TGACAGGAAT TTATACCGCA GGACAAGTCA TCCGAGGTAA CGCTAATTCA
GAGGGACAAA ATCATTGGTT TGCTACGGAT TCTTTTTGTT TAGACTATTC TTTAATTGCC
CCTTCAGAAA AAGCAGTCAA AGGGATTTTA TCGGGAAGAA ACTGGGATGA ACAGCAATAT
CAAACTCAAA TAAAATCGTC TCAAAATCAA CTTTTAGCCC TCAATAAATC TCCAAAACAA
ATACAACCTG GGGGCTATCG TACCTATTTT GCACCTGCGG CCACGGCTGA TCTCTTAGGG
ATGCTATCTT GGGGTGCAAT TAGTGAAGCG TCTCTGCGGC AGGGAGGAAG TGCTTTGATG
AAGTTAAAAG AAGGTAAGAC CCTATCTCCT AAGCTTAATT TACAGGAGAA TTTTAGTCTG
GGAAGCGTGC CTAAATTCAA CGAATTGGGT GAAATTTCTC CTGATATTTT GCCTTTAATT
ACTGAAGGAA ACCTAATCAA TACTTTGGTT AATTCTCGGA CAGCTACTGA ATATAAAATT
ACCGCTAATG GAGCCAATTC TTCTGAATCT TTGAGATCCC CTGAATTGGG TAAAGGAACC
TTATCCAGTG AGGATATTTT CAACACATTA GGCACGGGGT TATATCTATC TAATTTACAC
TATTTAAACT GGAGCGATCG CACGGGGGGA AGAATTACGG GAATGACCCG TTATGCCTGT
TTTTGGGTAG AAAATGGCGA AATTGTGGCT CCTATTAAAG ACCTCAGATT TGATGACAGT
CTCTATCGTT TTTGGGGAGA AAATCTTGAA GCATTAACGG ACTTTCAAGA ATTTATTCCT
GAAACCAATA CCTATGAAAG ACGCGAAATA GGAGGCAGTT TAGTCCCTGG AATGTTAGTT
AATGATTTTC AATTTACTTT GTAG
 
Protein sequence
MNLDRLEQLE VTFNQLSEFL IDQLNNAEHL SLELSSEQTQ FIRFNNAKVR QTGLVTDGNI 
KLSFIANQRT VFMMFPFTGD LTTDQQNGLE SLNYLRQDIL QVPEDPHLVL PENKGTTREV
YRGDLLVPEI AVKTLLPEVQ NLDMTGIYTA GQVIRGNANS EGQNHWFATD SFCLDYSLIA
PSEKAVKGIL SGRNWDEQQY QTQIKSSQNQ LLALNKSPKQ IQPGGYRTYF APAATADLLG
MLSWGAISEA SLRQGGSALM KLKEGKTLSP KLNLQENFSL GSVPKFNELG EISPDILPLI
TEGNLINTLV NSRTATEYKI TANGANSSES LRSPELGKGT LSSEDIFNTL GTGLYLSNLH
YLNWSDRTGG RITGMTRYAC FWVENGEIVA PIKDLRFDDS LYRFWGENLE ALTDFQEFIP
ETNTYERREI GGSLVPGMLV NDFQFTL