Gene TM1040_2639 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2639 
Symbol 
ID4077942 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2773042 
End bp2774463 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content67% 
IMG OID638007963 
Productmicrocin-processing peptidase 1 
Protein accessionYP_614633 
Protein GI99082479 
COG category[R] General function prediction only 
COG ID[COG0312] Predicted Zn-dependent proteases and their inactivated homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.818625 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCAGA GCCCTTGCGA TACGGCGGCG CTCTGGCTAG CTCTGGTTCA ATGCCACCCA 
TTTGCGAAGG TGCCCATGAC ACAGACTCCT GAAACGCTTT GCCACGCCCT CCTTGATGCC
GCCCAAAAGG CCGGGGCCGA TTCTGCCGAC GCCATGGCCG CCGAGGGCAG CTCGCTCTCG
ATCGAGGTGC GCGAGGGCGC GCTGGAACAT GCAGAGCGCT CCGAAGGGGT GGACATCGGG
CTGCGGGTCT TTGTCGGCCA GCGTCAGGCG CAGGTGTCCT CCTCCGATAC CCGCCCCGAA
ACCCTGACCG CGATGGCCGA ACGCGCCGTG GCCATGGCCA AAGAAGCGCC CGAAGATCCC
TATGCCGGGC TTGCTGACCC CGCGCAGCTG GCCAAATCCT GGGATCTCGA CGCCCTTGAG
ATGGCCGACC CCAGCGCCGA GCCTGCGCCC GATCAACTGC AACAGGACGC GCTGGCCGCC
GAAAGCGCCT GCGCCGCCAT CGACGGCATT TCTCAGGTCC AGTCCGCCGC GGCGGGCTAT
GGGCGTCATG ACATCCACAT GGCCGCGAGC AACGGGTTCT CCGGGGGCTA TGCGCGCACC
AGCCGCTCGA TCTCCTGTGT GGGGATTGCG GGCACCGGCA CCGGCATGGA GCGCGACTAT
GACGGCGACA GCCGCATCTA TCAAACCGAT CTGCGCAGCG CCGAAGAGAT CGGGCGCACC
GCTGGCGAGC GCGCCATCGA ACGTGTGAAC GCCCGCCGCC CCAAAACTGG CGCCTATCCC
GTGCTCTTTG ACGAGCGGAT CTCCTCATCC CTCATCGGGC ATCTTCTGGG TGCCGCCAAT
GGCGCGTCGG TGGCGCGCGG CTCCTCGTGG CTCAAGGACA GTCTCGGCGC GCAGATCCTG
CCCGAGGCCT TCTCGGTCAT CGAGGACCCC CTGCGCCCCC GCGTTTCAGG CTCGCGCCCC
TTTGATGGCG AAGGCCTGCC CACGCAGCGC CGCGCGATCG TCGACAAGGG CGTGCTGACC
GGCTGGACCA TGGATCTGGC TTCGGCGCGC AAACTTGGCC TTGAGAGCAC AGGCAACGCC
GCGCGTGGCA TCGGGTCGGT GCCGTCGCCC TCCAACTGGA ACATCGCTCT GACCCAGGGG
CAACAGACCC GCGAAGAGCT GCTGCGCGAC ATGGGCACCG GGCTTCTGGT CACCTCGATG
ATCGGCTCCA CCATCAACCC CAACACGGGC GACTACTCGC GCGGCGCTTC GGGCTTCTGG
GTGGAGAACG GCGAGATCCA GTATCCGGTC AACGAGGTCA CGATTGCCGG GAACCTCCTC
GATATGCTGA AAACGCTGGT CGCCGCCAAC GACGCCCGCA CACATCTGTC GCGGGTGGTG
CCATCGCTTC TGGTAGAGGG ACTGACCCTT GCCGGAGAAT GA
 
Protein sequence
MAQSPCDTAA LWLALVQCHP FAKVPMTQTP ETLCHALLDA AQKAGADSAD AMAAEGSSLS 
IEVREGALEH AERSEGVDIG LRVFVGQRQA QVSSSDTRPE TLTAMAERAV AMAKEAPEDP
YAGLADPAQL AKSWDLDALE MADPSAEPAP DQLQQDALAA ESACAAIDGI SQVQSAAAGY
GRHDIHMAAS NGFSGGYART SRSISCVGIA GTGTGMERDY DGDSRIYQTD LRSAEEIGRT
AGERAIERVN ARRPKTGAYP VLFDERISSS LIGHLLGAAN GASVARGSSW LKDSLGAQIL
PEAFSVIEDP LRPRVSGSRP FDGEGLPTQR RAIVDKGVLT GWTMDLASAR KLGLESTGNA
ARGIGSVPSP SNWNIALTQG QQTREELLRD MGTGLLVTSM IGSTINPNTG DYSRGASGFW
VENGEIQYPV NEVTIAGNLL DMLKTLVAAN DARTHLSRVV PSLLVEGLTL AGE