Gene TM1040_2313 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2313 
Symbol 
ID4078303 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2434344 
End bp2436134 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content66% 
IMG OID638007635 
Productpeptidoglycan binding domain-containing protein 
Protein accessionYP_614307 
Protein GI99082153 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAACC TATTTGCCTC GGTGTTGTGG GCAGTGATCC TGTGCTTGAG TGCGCTATCG 
GGCGCCTCGC AGGCGCAGCA ATCCAGTGCC GGAGGCGTCT GGATTCAGGT GGCCGCGCGC
CCGTCGCTGC AACAGGCACA GGAAGAGGCG CGCAATTTCG CGGCCCGCCT GCCCGATGTG
TCGGGCTATG CCTTGGGCGG CGGCTGGTAT GGCATCGTGA TTGGCCCCTA TGCTCGCCCC
GATGCCGAGC AGGTGTTGCG GGTCTATCGC GCTGAGGGCC AGATCCCGCG CGACAGCTTC
ATCACCTTCT CAAGCAATCT GCGCAGCCGG TTTTACCCCG TTGGAGCGGA CGCCGACACA
GGCACCACCG CCCCTGCGCC GCTGACCACC TCCGATGCCG AGACCGCGCC GCCAGAGCCA
GCGGAGGACG CCACGTCTGA GGTGACGCCC GAGGTGGTCG TGGTGGATGA GACCCCCGCC
GAAGCGCGCC AGAGCGAACG CGCCCTCACA CGCGAAGAAC GCATGGATCT GCAAATCGCG
CTGAAGGCGG CAGGGTTCTA CACCTCCGGC ATCGACGGTG CCTTTGGCCG CGGCACGCGC
GCCTCCATGT CTGACTGGCA GGTGGCACGC GGGTATGAGC CGACCGGCGT TCTGACCACT
GCCCAGCGCC AGGCCCTGAT GGACGAGTAT AACGCGCCGC TGATTTCCAC CGGCATGGCG
TCTTATACTG ACGAGCGCGC GGGCATCCGC ATGGATCTGC CCTTGGGCGA AGTGGCCTTC
AGCCGTTATG AGCCGCCTTT TGCGCATTTT GACACGGCCG GAGATCTCGG CGCTCGGGTT
CTGCTGATTT CGCAGCCCGG CGACCGCCGC ACGCTCTATG GTCTCTATGA CATCATGCAG
ACGCTGGAGA TTGTGCCGCT CGAGGGACCA CGCGAACGCA GCGGCGACCG CTTCACGCTC
GAAGGCCGCA ACAGCGACAT CGTCTCTTAT ACCGAGGCCC GCCTCGACAA TGGCGAGATC
AAAGGCTTCA CGCTGGTCTG GCCCGCAGGG GACGAGGCGC GCCGCGCCCG CGTTCTTTCG
GCCATGCAAG AGAGCTTTAC CCGTCTGGAG AACGTGCTCG ATCCGGCGGC CGGCATGGAT
GATGCGCAGT CCATCGACCT TGTGTCGGGG CTGGAGATCC GCAAACCGCG CCTGTCGCGC
TCGGGTTTCT TTGTGTCGGG CGACGGTGTG GTTGTGACCA CCGCCGATGT GGTGAACGGC
TGCGGCAGCG TCACAGTAGA CCGCGATGTG AAGGCCGAGG TGCTGTTGCG CGATAGCGAG
GCAGGCATTG CAGTGCTGAA ACCTTCCGAG GCACTGGCCC CGATGGCGAC TGCGCCGCTT
GCGGCCAACA CGCCGCGCCT GCGCAGCGCG CTTGCCGTGT CGGGCTACTC CTACGGCGGA
ATCCTTGGCG CACCAAGCCT CACATGGGGC GCGCTCAGCG ATCTCAAGGG GCTGCAAGGC
GAAAGCCATC TGGCGCGTCT GGACCTGACC GCCCAGCCCG GTGATGCGGG CGGGCCGCTT
CTTGGCAAGG ATGGCACCGT GCATGGCATG CTGCTGCCGG AGGCCACCAA TGGCCCCACC
CTGCCCGAGG GCGTGCGCTT TGCCCTAAAG GGCGAGGTCA TCCGCATGGC GCTCGAACAG
GCTGGCGTGG GCTTGCCCGC GACCGAGGCC GCCGCCACCG GCGAGCTGCC CATCTCGATC
CTGCAAAAAG AGGCCACTGG CCTCACCACG CTGGTCAGCT GCTGGGAGTA A
 
Protein sequence
MKNLFASVLW AVILCLSALS GASQAQQSSA GGVWIQVAAR PSLQQAQEEA RNFAARLPDV 
SGYALGGGWY GIVIGPYARP DAEQVLRVYR AEGQIPRDSF ITFSSNLRSR FYPVGADADT
GTTAPAPLTT SDAETAPPEP AEDATSEVTP EVVVVDETPA EARQSERALT REERMDLQIA
LKAAGFYTSG IDGAFGRGTR ASMSDWQVAR GYEPTGVLTT AQRQALMDEY NAPLISTGMA
SYTDERAGIR MDLPLGEVAF SRYEPPFAHF DTAGDLGARV LLISQPGDRR TLYGLYDIMQ
TLEIVPLEGP RERSGDRFTL EGRNSDIVSY TEARLDNGEI KGFTLVWPAG DEARRARVLS
AMQESFTRLE NVLDPAAGMD DAQSIDLVSG LEIRKPRLSR SGFFVSGDGV VVTTADVVNG
CGSVTVDRDV KAEVLLRDSE AGIAVLKPSE ALAPMATAPL AANTPRLRSA LAVSGYSYGG
ILGAPSLTWG ALSDLKGLQG ESHLARLDLT AQPGDAGGPL LGKDGTVHGM LLPEATNGPT
LPEGVRFALK GEVIRMALEQ AGVGLPATEA AATGELPISI LQKEATGLTT LVSCWE