Gene PCC8801_1457 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1457 
Symbol 
ID7103657 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp1530335 
End bp1531588 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content43% 
IMG OID643474533 
Productpeptidase M50 
Protein accessionYP_002371670 
Protein GI218246299 
COG category[R] General function prediction only 
COG ID[COG1994] Zn-dependent proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGGAA AATGGCAAAT CGGTTCTTTA TTAGGAATCC CTCTCTATCT TGATCCGTCT 
TGGTTTATTA TTCTCCTGTT TGTGACCTTG GTTAATGCAG CAGAGATTAG CACGCAAAGA
TTAGGGGGAA ATCTGCCAGG TTTGGGATGG TTAGCTGGAT TTATCATGGC TTTATTGCTA
TTTGGGTCGG TTCTGCTTCA CGAATTGGGA CACAGCTTAG CCGCTCGCGC TCAGGGGATT
AAGGTTAATT CAATTACACT ATTTCTCTTT GGGGGAGTGG CCTCGATCGA TCGAGAATCG
AAGACCCCGG TCGGAGCCTT TTGGGTAGCG ATCGCAGGTC CTTTGGTCAG TTTTGGTCTG
TTTATTTTGT TTTTTAGTCT CATTCAATGG GTGAATATTT CCAGCTTTGT CCCATCCGTT
ACTCAAGAGC TTGGGAATAT TAAGTCATTA TTAAGGTATA TGTTGGGAGA TTTAGCCCGA
ATTAATCTGG TTTTAGGGAT TTTTAATTTA ATTCCAGGGT TGCCCCTCGA TGGGGGACAA
ATTTTAAAGG CGATTGTTTG GAAACTAACG GGCGATCGCT TTACGGGGGT TCGTTGGGCA
GCAGCGAGTG GTAAATTAAT TGGTTGGGTG GGAATCTCGA CGGGATTGTT TTTTGTCTTG
ACAACAGGGG GTTTAAGTCC CGTTTGGATC GCGTTGATCG GTTGGTTTGT TCTGCGTAAT
GCTGATACCT ATGATCGCTT GACCGCTTTG CAAGAAAGTT TACTCAAAAT TGTGGCGGCT
GAAGCGATGA GTCATGATTT TCGGGTGATT AATGCTCATC TAACCTTAAA CCAATTTGCT
CAAGAATATA TTCTCAGAGA TTTGAATACG TCTTTAGTGT ATTATGCTGC GTCTGAAGGT
CGTTATCGGG GACTCATTCG TGTTCAAGAT TTACAGTTAA TTGAGCGTTA TCTCTGGGAA
AATCAAACGT TAATCGATAT TGTGCATCCT TTAACGGAGA TTCCTTCCGT TATAGAAAAG
ACTCCCTTAG CAGAGGTAAT TGAAACGCTA GAATCTATTA GCGATCGTTC TGTAACAGTA
TTATCTCCGG CGGGAGCCGT TGCAGGAGTT ATTGATCGCG CAGATATTGT GAAAATTATC
GCTATACGCC ATAATCTTCC GATTCCTGAC AATGAAATCC ATCGGATCAA AGCTGAAGGA
ACCTATCCCC CTTATTTACA ACTCCCTGCG ATCGCTAAAA GTCTTCATGA TTAG
 
Protein sequence
MQGKWQIGSL LGIPLYLDPS WFIILLFVTL VNAAEISTQR LGGNLPGLGW LAGFIMALLL 
FGSVLLHELG HSLAARAQGI KVNSITLFLF GGVASIDRES KTPVGAFWVA IAGPLVSFGL
FILFFSLIQW VNISSFVPSV TQELGNIKSL LRYMLGDLAR INLVLGIFNL IPGLPLDGGQ
ILKAIVWKLT GDRFTGVRWA AASGKLIGWV GISTGLFFVL TTGGLSPVWI ALIGWFVLRN
ADTYDRLTAL QESLLKIVAA EAMSHDFRVI NAHLTLNQFA QEYILRDLNT SLVYYAASEG
RYRGLIRVQD LQLIERYLWE NQTLIDIVHP LTEIPSVIEK TPLAEVIETL ESISDRSVTV
LSPAGAVAGV IDRADIVKII AIRHNLPIPD NEIHRIKAEG TYPPYLQLPA IAKSLHD