Gene TM1040_1793 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1793 
Symbol 
ID4076822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1886267 
End bp1887547 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content67% 
IMG OID638007108 
Productpeptidase M23B 
Protein accessionYP_613788 
Protein GI99081634 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.532041 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0360441 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGCA CGCCGAAGGT CTTCCGCACG ATCCGCCCGA TCCCGCTGCC TTTTCCCCGG 
TCCTCGTCCG CATCGAGTGG CGCATCGCGC CTTCGACCTT CGATTGCCCT ACTGCCGCTG
GTGCTGCTGG CAGCCGCCTG TACCGAGCCG CTCGATTATG ATCTGCGGGG CCAGCTTGGC
GCCTTCAACA CCACAAGGGC CGCGCAGACC GCCACCGAAA ACCGCCCAGC GCCCGACAGC
CGGGGGCTGA TCACCTACCC GTCCTATCAG GTTGCAGTCG CGCGCAATGG CGACACGGTT
GCGGATCTCG CCGGGCGCGT CGGGCTGCCC GCGGTCGAGG TCGCGCGTTT CAACGGCGTG
GAGACCACCG ACCCCCTGCG CAAAGGTGAG GTGCTCGTCC TGCCCCGCCG CGCGCCCGAG
GCCTCTGCGC GCAGTGGCAC CGCCACGCCT GGCGGCGTGG ATATCGCCTC GCTTGCAGGC
AGCGCCATTG ACAGCGCCCC CTCGACCTCG CCCAATCCGG GCTCCGTGAC CACCACCACG
CTGCAGAACA CCCCCAGCAA ACCCGCACCC ACGCGCGTGC AGGCGGGCCC AGAACCCGTG
CGCCACAAGG TGACGCGTGG CGAGACCGCC TATACGATCG CGCGGCTCTA CCAGATCCCG
GTCAAGGCAC TGGCGGAATG GAACGGGCTT GGAAGCGATT TTGCGATCCG CGAAGGCCAG
TACCTGCTGA TCCCGCTCAA AGATCCCAAT GCCCGGCCGC CAAAGGCAGA GACGCAAGAG
GCCGTGACCG CCCCAGGTCA GGGCAGTGCG ACCCCGACCC CGCCCAGCGC CACGCAGCCC
TTGCCGGATG AGGATGTCAA ACCCGCCGCA GAGGCCCGCA CAGAGCCCAC GCCCACGGTC
AAAATCCAAG AGCCCACCCG CGCATCAGAG GCCGCGATGG CCTATCCGGT GACCGGCAAG
ATCATTCGCG CCTACTCCAA GGGCAAGAAC GACGGCATCG ACATCGCCGC CGCCCCCGGC
AGCCCCGTGA AGGCCGCAGA GGCCGGCACC GTCGCCGCGA TCACCGCGGA CTCCAACAAG
GTGCCGATCA TCGTGATCCG TCACGACCGC AATCTTCTGT CGGTCTATGC CAATGTCGAT
GGGATCCGGG TTCAAAAGGG CGATCGGGTC AACCGCGGCC AGAACATCGC CAAGCTGCGC
GGCAGCGCCG AAGAGGCCTA TGTCCACTTT GAAGTGCGCG ACGGCTTTGA GAGCGTCGAC
CCTCTGCCCT ATCTGCAATA A
 
Protein sequence
MTRTPKVFRT IRPIPLPFPR SSSASSGASR LRPSIALLPL VLLAAACTEP LDYDLRGQLG 
AFNTTRAAQT ATENRPAPDS RGLITYPSYQ VAVARNGDTV ADLAGRVGLP AVEVARFNGV
ETTDPLRKGE VLVLPRRAPE ASARSGTATP GGVDIASLAG SAIDSAPSTS PNPGSVTTTT
LQNTPSKPAP TRVQAGPEPV RHKVTRGETA YTIARLYQIP VKALAEWNGL GSDFAIREGQ
YLLIPLKDPN ARPPKAETQE AVTAPGQGSA TPTPPSATQP LPDEDVKPAA EARTEPTPTV
KIQEPTRASE AAMAYPVTGK IIRAYSKGKN DGIDIAAAPG SPVKAAEAGT VAAITADSNK
VPIIVIRHDR NLLSVYANVD GIRVQKGDRV NRGQNIAKLR GSAEEAYVHF EVRDGFESVD
PLPYLQ