Gene Cphamn1_2127 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_2127 
Symbol 
ID6375821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp2304158 
End bp2305405 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content52% 
IMG OID642684617 
Productpeptidase U32 
Protein accessionYP_001960516 
Protein GI189501046 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00706152 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGACAGAC ACCGTAAAAT CGAGCTCATT TCACCTGCCG GAGACCACAC ATCGCTTCTT 
GCCGCCCTTC AGGCGGGAGC AGATGCCGTC TATTTCGGTG CGGAGGGATA TAATATGCGG
GCGGCCAGCA GAAGTTTCAC GCCCGATGAT TTTCCGACTG TCTCCGGCCT TTGCGCGACC
TATGGCGCAA AAGCCTACCT GGCACTCAAT ACCGTGATAT ACGATGAAGA ACTGCCGGAT
GTTCAAAAGA CGGTCCGGGC AGCAAAAGCT GGTGGTCTCG ACGCGATCAT CTGCTGGGAC
CAGTCGGTTA TAGAAGCGTG TCGGGAAGCC GGGATGCCCT TTCATCTCTC AACGCAGGCA
TCAGTCAGCA ATTACCGCGC GGTACGCTAC TATGCCTCGC TTGGCGCGGG AATGATCGTA
CCCGCCCGTG AACTGACCCT TGAACAGATC ATAAAGATCA CCGAAAGAAT CCGCCTGGAA
AAACTGGACG TAGCCATCGA ATGCTTTGTT CATGGCGCCA TGTGTATGGC CGTGTCGGGA
AGATGCTTTC TCTCGCAGGA CATCTTCGGG CGTTCAGCCA ACCGCGGCGC ATGCATGCAG
CCCTGCAGAC GCCGTTACAG GATCATCGAT AGTGATGACG GTCATGAACT GGATCTCGGG
ACAGATACCG TGATGAGCCC TGAAGACCTT TGCACCATTT CGTTCATTGA CAAACTCATC
GATGCAGGCA TAACCGGCTT CAAGATAGAA GGCCGGAACC GAAGTCCTGA ATATGTCCAT
ACTACAACGA AATGCTACCG CAAGGCCATC GACTACACTC TCGAACATGG ACACGAAAAA
CAGTTCAGAC GCCATTTTGA AGCTCTGGCG AAAGAACTGG CCACGGAACT TCACAAGGTC
TACAACCGCG GATTTTCACA TGGATTTTAC CTTGGCGTTC CTGTTGATTC ATGGACACAG
CAGTACGGGT CTCTTGCCAC GGAAAAAAAA GTGTATGCAG GTACTGTGCA GAAATACTAC
CCTAAAGCAA AGGTGGCGGA AATCCTGATA CACACCAGAG GAATACACTC GGAAGAAAAA
CTCTCGATAC AGGGAACAAC AACCGGACTG GTTGTTCTCA ACGTCCAGTC GATGCGGGTT
AACGATCAGC CTGCTCTCTC GGCATCAAAA GGAGATATTG CGACAATCCC CTGCGATAAA
AAAGTCAGGA AAAACGACAA GGTGTATGTG CTGGAAGCTG CGGAATAA
 
Protein sequence
MDRHRKIELI SPAGDHTSLL AALQAGADAV YFGAEGYNMR AASRSFTPDD FPTVSGLCAT 
YGAKAYLALN TVIYDEELPD VQKTVRAAKA GGLDAIICWD QSVIEACREA GMPFHLSTQA
SVSNYRAVRY YASLGAGMIV PARELTLEQI IKITERIRLE KLDVAIECFV HGAMCMAVSG
RCFLSQDIFG RSANRGACMQ PCRRRYRIID SDDGHELDLG TDTVMSPEDL CTISFIDKLI
DAGITGFKIE GRNRSPEYVH TTTKCYRKAI DYTLEHGHEK QFRRHFEALA KELATELHKV
YNRGFSHGFY LGVPVDSWTQ QYGSLATEKK VYAGTVQKYY PKAKVAEILI HTRGIHSEEK
LSIQGTTTGL VVLNVQSMRV NDQPALSASK GDIATIPCDK KVRKNDKVYV LEAAE