Gene Cyan7425_4003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan7425_4003 
Symbol 
ID7289951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 7425 
KingdomBacteria 
Replicon accessionNC_011884 
Strand
Start bp4037075 
End bp4038112 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content57% 
IMG OID643586975 
Productmetalloendopeptidase, glycoprotease family 
Protein accessionYP_002484678 
Protein GI220909367 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00143868 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000399552 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCACAG TTTTAGCAAT TGAAACAAGT TGTGATGAAA CCGCCGCAGC AGTTGTAAAG 
AATCGTCATA TTTACAGCAA TGTGATTGCT TCTCAGATTG CCGCCCACCG TCCCTATGGG
GGAGTGGTGC CGGAAGTGGC CTCTCGTCAG CACCTGGAAA ATATCAATGC CGTGATTGAC
GAGGCTCTGG CAGTTGCGGG TCTGGGTTGG GCAGAGATCG ATGCCATTGC CGCCACCTGT
GCGCCCGGAC TGGTCGGTTC CCTGCTGATT GGGTTGACCG CTGCTAAAAC CCTGGCCCTG
GTGCATCAAA AGCCGTTTCT GGGCATTCAT CACCTGGAAG GCCATATCGC TGCCTCCTAC
CTGGCCCATC CCGATTTGCA TCCCCCCTTT CTCTGTCTGC TAGTGTCTGG GGGCCATACC
AGTTTGATCT ACGTCAAAGA TATTGGGGAC TATGAAACCC TGGGGCAAAC ACGGGACGAT
GCCGCCGGAG AAGCCTTTGA TAAGGTGGCC CGCTTGCTGG GGTTGGGCTA TCCCGGTGGG
CCAGTCATCG ATCGCCTCGC CAGTCAGGGC AATCCTGCCG CTTTTCATCT GCCAGAAGGT
AATATTTCCC TACCCGGTGG GGGCTATCAT CCCTACGACT CCAGCTTTAG TGGGCTTAAA
ACCGCCGTTC TGCGTCTGGT GGAGAAACTC AAAACTGAGG GAGACTTACC GATCGCCGAT
CTGGCCGCCA GTTTTCAGGA CTGTGTGGCC CGCTCCCTCA CCCGTCGGAC GATCGCCTGT
GCGCTGGATT ATAGTCTGGA GACGATCGCG ATCGGGGGTG GGGTAGCCGC TAACCGGGGG
TTGAGGTCAC ACCTGCAGGC CGCCGCTGCT GCCCATAATT TAAGGGTGTT ATTTCCGCCC
CTCTCCCTCT GTACAGATAA TGCCGCCATG ATTGCCTGTG CCGCTGCTGC ACATCTGGAA
CGGGGCCATA CCTCGCCCCT CACCCTGGGA GGTCAATCAC GCCTGGCCAT TACAGAGGTG
ATGCAGCTTT ATCAGTGA
 
Protein sequence
MTTVLAIETS CDETAAAVVK NRHIYSNVIA SQIAAHRPYG GVVPEVASRQ HLENINAVID 
EALAVAGLGW AEIDAIAATC APGLVGSLLI GLTAAKTLAL VHQKPFLGIH HLEGHIAASY
LAHPDLHPPF LCLLVSGGHT SLIYVKDIGD YETLGQTRDD AAGEAFDKVA RLLGLGYPGG
PVIDRLASQG NPAAFHLPEG NISLPGGGYH PYDSSFSGLK TAVLRLVEKL KTEGDLPIAD
LAASFQDCVA RSLTRRTIAC ALDYSLETIA IGGGVAANRG LRSHLQAAAA AHNLRVLFPP
LSLCTDNAAM IACAAAAHLE RGHTSPLTLG GQSRLAITEV MQLYQ