Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2000 |
Symbol | |
ID | 4077457 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 2103846 |
End bp | 2105762 |
Gene Length | 1917 bp |
Protein Length | 638 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638007315 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_613994 |
Protein GI | 99081840 |
COG category | [R] General function prediction only |
COG ID | [COG1574] Predicted metal-dependent hydrolase with the TIM-barrel fold |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGACCT TGAACCGCAG AAATCTGTTG AAGGGCAGCG TTGCGGCGGC CTGTTTGACG GTGCCCGCCG TGCGCGCAGT TGGGCAGACG AACTCGCTCT CGGAGATCGG CAAGTTGTTT GCACCGGTGC CTCGGGCAAA GATCTTCACC GCGCGCGATA TTGTCACGCT TGACCCTTCT CAACCCTCGG CCGAGGCCAT CGCGGTTGTC GGCACGCGAA TCCTCGCGGT GGGCGCGCTC AGCGAGGTGC AGGAGATGCT CGGGGATCAA CCCTTTGATG TGGATGACAG TTTCGCCGAC AAGGTCATCG TGCCAGGGTT TATCAGCCAG CATGATCATC CGGTTCTGGC GGCGCTGTCG ATGTCGTCCG AGATCCTGTC GATCGAGGAG TGGGCCTTGC CGACGGGCAC CGTGCCTGCG GTCAAGGACA AGGCGGACTT CATGAAACGC CTCACGGCAG CCGTGGAGGC GCGTACCACC CCCGGAGAGC CGGTGGTGAC CTGGGGGTAT CATCCCGCCT TTTACGGTCC TTTGACACGC GCGGATCTCG ACGGGATCAG CACCGAGCGC CCCATTCTGG TGTGGGGTCG CTCCTGCCAC GAGATGATCC TCAATAGCGC CGCCCTGACG GCCGGTGGGG TGACACAGGC CGCGGTGGAG GCCTTCGACG CGGCCTCGCA GAAACAGGCC AACCTTGCGG AGGGCCATTT CTGGGAGCAG GGGCTTTTTG CGGTTCTGCC TCATATTGCG TCTTTGGTGG CCACGCCCGA ACAGCTGCGC GCTGGGCTCG AACTCAGCCG TGACTTCATG CACAGCAAGG GCATCACCTT TGGCAACGAG CCCGGCGGCA TTCTTGCAAA ACCCGTGCAG GACGGAGTCA ATGCGGTGTT TTCCAGCCCG GATATGCCGT TTCGCTGGTC GTTCATGGTC GATGCCAAGA GCATGGTCGC CAGTTACGCC GATGACGGCG AGGTCATTGC CCGGTCAGAG GCGCTTCAAT CCTGGTACTA CGGGATGACA AGCCTCGCGC CGCGCCAGGC CAAACTGTTT TCGGATGGGG CGATCTATTC GCAGCTCATG CAGGTGCGCG CGCCCTATCT CGACGATCAC CACGGCGAGT GGATGATGGA GAAAGAGCTG TTTGAGCGCG CGTTCAGGGT CTACTGGGAT GCGGGGTATC AGCTGCACAT CCATGTCAAC GGTGATGCCG GGTTGGATCG TGTGCTGGAG ACGCTCGAGA CCAACATGCG TCGCAATCCG CGTTTTGATC ACCGTACGGT CATCGTGCAT TTTGCCGTCA GCGCCTTTGA CCAGGTGGAG CGGATCAAGG CGCTCGGGGC CATCGTGAGC GGCAATCCCT ATTATGTCAC GGCGCTTGCG GATCAGTATT CCGAAGTGGG TCTTGGCGCC GAGCGGGCAG ACAGCATGGT GCGTCTCGGG GATCTCTCGC GGGCCGGGGT GCGTTGGTCG CTCCATTCGG ATATGCCGAT GGCGCCAGCT GACCCGTTGT TCCTGATGTG GTGCGCGGTG AACCGGGTGA CCACGTCGGG CCGGGTGGCG GCGCCGGAAC AGGCGGTGTC CGCCGAGGAC GCGCTGCGCG GCGTCACCAT CGAAGCGGCC TATTCGCTGC AGATGGAAGA AGAAATCGGC AGCCTCGTGC GTGGCAAGCG GGCGAATATG ACGATCCTTG CCGAGAACCC GCTCGAGGTG GATCCCATGG CGATCCGCGA GATCGAGGTC TGGGGCACGG TGATGGAGGG GCGGGTGCTG CCGGTCCGTT CGAGTGATCG AGCCGCAGCA ACGCAGCAGG ACGCCTCGGC TGGTCCGCGT GAGGCGGTTT CACCAAATGG AGCGCCAGCC TTTGACAAAG CTGCCCTTGA GCATGCCCTG AAGGTCACGC ACGCGCATCA TCTCTGA
|
Protein sequence | MRTLNRRNLL KGSVAAACLT VPAVRAVGQT NSLSEIGKLF APVPRAKIFT ARDIVTLDPS QPSAEAIAVV GTRILAVGAL SEVQEMLGDQ PFDVDDSFAD KVIVPGFISQ HDHPVLAALS MSSEILSIEE WALPTGTVPA VKDKADFMKR LTAAVEARTT PGEPVVTWGY HPAFYGPLTR ADLDGISTER PILVWGRSCH EMILNSAALT AGGVTQAAVE AFDAASQKQA NLAEGHFWEQ GLFAVLPHIA SLVATPEQLR AGLELSRDFM HSKGITFGNE PGGILAKPVQ DGVNAVFSSP DMPFRWSFMV DAKSMVASYA DDGEVIARSE ALQSWYYGMT SLAPRQAKLF SDGAIYSQLM QVRAPYLDDH HGEWMMEKEL FERAFRVYWD AGYQLHIHVN GDAGLDRVLE TLETNMRRNP RFDHRTVIVH FAVSAFDQVE RIKALGAIVS GNPYYVTALA DQYSEVGLGA ERADSMVRLG DLSRAGVRWS LHSDMPMAPA DPLFLMWCAV NRVTTSGRVA APEQAVSAED ALRGVTIEAA YSLQMEEEIG SLVRGKRANM TILAENPLEV DPMAIREIEV WGTVMEGRVL PVRSSDRAAA TQQDASAGPR EAVSPNGAPA FDKAALEHAL KVTHAHHL
|
| |