Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0039 |
Symbol | |
ID | 4076306 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 41146 |
End bp | 42195 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 638005326 |
Product | dipeptidase AC |
Protein accession | YP_612034 |
Protein GI | 99079880 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.508075 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.304593 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACACTC CGCTTATTTT CGATGGTCAC AACGACCTGC TGCTGCGCCT TCACAATAAA GATGTAGGAA TGGACCAGGC GGGGGTTTTT GGCAGCGGAG GCCGCCAGAT CGACGTTGAC AAAGCCAAGG CCGGTGGGTT CGGCGGCGGA TTCTTCGCCA TCTTTGTGCC TGGCGAAGAG TCGGTCTCCC ATGACGAAGA GATGATGAAG GACACCTATG ACCTGCCGCT TCCAGAGCAG GTCACGTGGC ACAACGCCAT CAAGGTGGCC CTGTCCCAGG CCGCTCTCCT GATCGAGCTC GAAAGGCAGG GCGCGCTGCA GATTTGTCGC TCGACCGCAG AAATTCGCAC GGCGATGGAA CAAGGTCTGA TGGCCGCCGT GATGCATATG GAAGGTGCAG AGGCAATCGA CCGTGATTTC CACACGCTTG ACGTCCTGCA CGGCGCGGGG CTGCGCTCTC TTGGGCCGGT CTGGAGCCGC CCAACCCGCT TTGGCCATGG GGTTCCGTTT CGCTATCCCT CCACCGGGGA CACGGGCGAG GGCCTCACGG AAGATGGGTT TCGCTTGATC AAACGCTGCA ATGAGATGCG GATTATGATC GATCTCTCGC ACATGACGGA AGCCGGTTTT TGGGACGTGG CCCGCGTCAG TGATGCACCT TTGGTGGCGA CCCACTCGAA CGCGGTGGCG CTCACCCGGC ATAGCCGCAA CCTGACCGAC CGCCAGTTGC ATGCGATCCG GGACAGTGAC GGCATGGTCG GGCTGAATTT TGCGGTGGCC TTCCTGCGCG AAGACGGACG CATGGACGAA AACACACCGA TTTCGCGCAT GTTGGATCAT CTCGATTACC TCATCGCAGA GGTTGGCGAG GATCGGGTTG GCATGGGCTC GGATTTCGAC GGGGCAACGG TACCCGCCGA GATCGGCACA ATCGCAGGCC TGCCGGCGTT GCGCCGCGCG ATGCGGGACC GCGGCTATGA CGATGCTTTG ATGAAGAAAC TCTGCCATGA AAACTGGCTC CGAGTCCTGG GCAAGACCTG GGGAGAATAA
|
Protein sequence | MNTPLIFDGH NDLLLRLHNK DVGMDQAGVF GSGGRQIDVD KAKAGGFGGG FFAIFVPGEE SVSHDEEMMK DTYDLPLPEQ VTWHNAIKVA LSQAALLIEL ERQGALQICR STAEIRTAME QGLMAAVMHM EGAEAIDRDF HTLDVLHGAG LRSLGPVWSR PTRFGHGVPF RYPSTGDTGE GLTEDGFRLI KRCNEMRIMI DLSHMTEAGF WDVARVSDAP LVATHSNAVA LTRHSRNLTD RQLHAIRDSD GMVGLNFAVA FLREDGRMDE NTPISRMLDH LDYLIAEVGE DRVGMGSDFD GATVPAEIGT IAGLPALRRA MRDRGYDDAL MKKLCHENWL RVLGKTWGE
|
| |