Gene TM1040_0039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0039 
Symbol 
ID4076306 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp41146 
End bp42195 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content60% 
IMG OID638005326 
Productdipeptidase AC 
Protein accessionYP_612034 
Protein GI99079880 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.508075 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.304593 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACACTC CGCTTATTTT CGATGGTCAC AACGACCTGC TGCTGCGCCT TCACAATAAA 
GATGTAGGAA TGGACCAGGC GGGGGTTTTT GGCAGCGGAG GCCGCCAGAT CGACGTTGAC
AAAGCCAAGG CCGGTGGGTT CGGCGGCGGA TTCTTCGCCA TCTTTGTGCC TGGCGAAGAG
TCGGTCTCCC ATGACGAAGA GATGATGAAG GACACCTATG ACCTGCCGCT TCCAGAGCAG
GTCACGTGGC ACAACGCCAT CAAGGTGGCC CTGTCCCAGG CCGCTCTCCT GATCGAGCTC
GAAAGGCAGG GCGCGCTGCA GATTTGTCGC TCGACCGCAG AAATTCGCAC GGCGATGGAA
CAAGGTCTGA TGGCCGCCGT GATGCATATG GAAGGTGCAG AGGCAATCGA CCGTGATTTC
CACACGCTTG ACGTCCTGCA CGGCGCGGGG CTGCGCTCTC TTGGGCCGGT CTGGAGCCGC
CCAACCCGCT TTGGCCATGG GGTTCCGTTT CGCTATCCCT CCACCGGGGA CACGGGCGAG
GGCCTCACGG AAGATGGGTT TCGCTTGATC AAACGCTGCA ATGAGATGCG GATTATGATC
GATCTCTCGC ACATGACGGA AGCCGGTTTT TGGGACGTGG CCCGCGTCAG TGATGCACCT
TTGGTGGCGA CCCACTCGAA CGCGGTGGCG CTCACCCGGC ATAGCCGCAA CCTGACCGAC
CGCCAGTTGC ATGCGATCCG GGACAGTGAC GGCATGGTCG GGCTGAATTT TGCGGTGGCC
TTCCTGCGCG AAGACGGACG CATGGACGAA AACACACCGA TTTCGCGCAT GTTGGATCAT
CTCGATTACC TCATCGCAGA GGTTGGCGAG GATCGGGTTG GCATGGGCTC GGATTTCGAC
GGGGCAACGG TACCCGCCGA GATCGGCACA ATCGCAGGCC TGCCGGCGTT GCGCCGCGCG
ATGCGGGACC GCGGCTATGA CGATGCTTTG ATGAAGAAAC TCTGCCATGA AAACTGGCTC
CGAGTCCTGG GCAAGACCTG GGGAGAATAA
 
Protein sequence
MNTPLIFDGH NDLLLRLHNK DVGMDQAGVF GSGGRQIDVD KAKAGGFGGG FFAIFVPGEE 
SVSHDEEMMK DTYDLPLPEQ VTWHNAIKVA LSQAALLIEL ERQGALQICR STAEIRTAME
QGLMAAVMHM EGAEAIDRDF HTLDVLHGAG LRSLGPVWSR PTRFGHGVPF RYPSTGDTGE
GLTEDGFRLI KRCNEMRIMI DLSHMTEAGF WDVARVSDAP LVATHSNAVA LTRHSRNLTD
RQLHAIRDSD GMVGLNFAVA FLREDGRMDE NTPISRMLDH LDYLIAEVGE DRVGMGSDFD
GATVPAEIGT IAGLPALRRA MRDRGYDDAL MKKLCHENWL RVLGKTWGE