Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0512 |
Symbol | |
ID | 4077218 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 538014 |
End bp | 539153 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 638005808 |
Product | peptidase M20D, amidohydrolase |
Protein accession | YP_612507 |
Protein GI | 99080353 |
COG category | [R] General function prediction only |
COG ID | [COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase |
TIGRFAM ID | [TIGR01891] amidohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.252109 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCGACA CGACGCATAT CGAAGAGGCC ATCGCCCTGC GCCATGACCT TCATCGCCAC CCAGAGTTGG GGTTCGAAGA GCATCGCACC TCCGAGATCA TCGCGGGCCT TTTGCACGGC TGGGGCTGGC GTGTGCATCG TGGTCTCGCA GGGACAGGGG TTGTTGCGCA GATGGGGCAG GGGGAACCGG TCATAGGTCT GCGCGCCGAT ATTGATGCAT TGCCGATGGA GGAGGCTACA GGGCTGGCCT ATGCCTCGGG GACGCCCGGC AAGATGCATG CCTGCGGTCA TGATGGTCAC ACCGCGATGT TGCTGCTGGC TGCGCGCAAG ATCGCCGAAG AGGGCGTGGC CACAGGCACG GTAACGCTGA TCTTCCAACC CGCAGAGGAG AATGATGGCG GCGCACGGGT CATGATTGAA CAGGGGCTGT TTCGCGAGTT TCCGGTCGAT CAGGTCTATG GCATTCACAA CTGGCCGGGG CTCGCGCCAG GGCGCATGGT GGCGCGTGAC GACAAGATGA TGGCGGCCTT TGCCGTCTTT GAGATCGAGG TTGCCGGGCA GGGTGGGCAT GGCGCAATGC CGGAGCAATC GGATGGGGTC ATCGCGGCTG CAGCGGCGAT GGCGTCGACG TTGCAGGAAA TTCCTGCCCG CGCTTTGTCG CCGCTGGAGC CGGGCGTGGT CTCGGTGACG CAGATCCATT CCGGATCGGC GTGGAATGTC TGCCCTGACC GTGCGGTCTT GCGCGGGACC GCGCGCTGGT TCGATCCCGC TGCTGGCGAT ACGATTGAGG CCCGTCTGAC ACAGGTGGCG AACGCCTGCG CCGCCGCACA GGGGTGCAGC GCGCGTATTG ATTACCAGCG TCGCTATCCC GCGACGATCA ACTCTGCTGT CGAGGCCGCT GCGGCGCGCG CGGTGGCCGC CGAGATGGGG CTTGAGACAG CCAATGTGGC ACCGAGCATG GCATCAGAGG ATTTTGCCTT TATGCTCAAC GAGGTGTCCG GCGCCTATAT CTGGCTTGGA GCGGCGCGCG ACGGAGAAAA TCCGGGTCTG CACTCAGCCA AATTTGATTT CAATGATGCC GTGCTCCCAG TCGGGGCCGA GTTCTGGGTG CGGCTGGCCC GCCGCCAGCT GGCGACCTGA
|
Protein sequence | MFDTTHIEEA IALRHDLHRH PELGFEEHRT SEIIAGLLHG WGWRVHRGLA GTGVVAQMGQ GEPVIGLRAD IDALPMEEAT GLAYASGTPG KMHACGHDGH TAMLLLAARK IAEEGVATGT VTLIFQPAEE NDGGARVMIE QGLFREFPVD QVYGIHNWPG LAPGRMVARD DKMMAAFAVF EIEVAGQGGH GAMPEQSDGV IAAAAAMAST LQEIPARALS PLEPGVVSVT QIHSGSAWNV CPDRAVLRGT ARWFDPAAGD TIEARLTQVA NACAAAQGCS ARIDYQRRYP ATINSAVEAA AARAVAAEMG LETANVAPSM ASEDFAFMLN EVSGAYIWLG AARDGENPGL HSAKFDFNDA VLPVGAEFWV RLARRQLAT
|
| |