Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1820 |
Symbol | |
ID | 4076966 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 1914075 |
End bp | 1915658 |
Gene Length | 1584 bp |
Protein Length | 527 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 638007135 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_613815 |
Protein GI | 99081661 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.575663 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.77509 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGACT ATCTTAAACA CCTCGCACAG GAAGCCGCAT TGGGCCGCAT GGGCCGCCGC GAGTTCCTCG GCCGCGCTGC TGCGCTGGGT GTGACCGCAT TGGGCGCGAA CTCGCTGCTG GCCTCTGCGG CCAAAGCGGC AGGCGCGCAA AAAGGCGGCA CCCTGCGTGT GGGTCTGCAA GGGGGCTCCA CCACCGACAG CCTCGACCCG GCTCTGGCGA CAAACCAGGT GGCCCTCTCG GTGATCCGTC TCTGGGGTGA GCCGCTGGTG GAACTGGGCG AAGAAGGCGG CCTCGTGGGT GCCGTCGCCG AAAGCTTTGA AGCCTCGGCA GACGCAAAAA CCTGGACCTT CAAGCTGCGC TCCGGCCTGA CCTTCTCCAA CGGCCAGCCG GTGACGGCGG CGGATGTTGT CGCCACCATG GAGCGTCACT CCGGTGAGGA CACAAAATCC GGTGCGCTTG GCATCATGCG CTCCATTTCC AACGTCAAGG CCGATGGCGA TACGGTGGTC TTTGAGCTCG AAGACGCAAA CGCCGACCTG CCCTACCTGC TGACCGACTA CCACCTGGTG ATCCAGCCCA ACGGCGGCAA GGACGATCCG GCGGCCGCGA TCGGCACCGG TCCTTATGTT CTGAAAGCCG TGGACATGGG CGTGCGCTTT GTGGCCGAGA AGAACCCCAA CTACTGGGGC GATCTGGGCA ATGCGCAGAC CATCGAGTAC GTGGTCATCA ACGACAATAC CGCCCGTGTG GCTGCACTGC AGGCCGGTCA GGTGGACATG ATCGACCGCG TGCCGCCGCG CACCGCGAAA CTGGTGGATC GCGCGCCGAA CATCTCCGTG CACTCCACTG CGGGTCCGGG CCACTATGTG TTCATCGCCC ATTGCGACAC CGATCCCTTT GCCAACAACG ACGTGCGCCT CGCACTGAAA TACGGCATCA ACCGTCAGGA AATGGTCGAC AAGATCCTGA ACGGCTTTGG CTCCGTGGGC AACGACTCGC CGATCAACGC CTCCTACCCG CTGTTCACCC AGCTGGAGCA GCGCGAGTAC GATCCCGAGA AGGCCAAGTT CCACATGGAA AAATCGGGCT ATGACGGTCC GATCCTGCTG CGCACCTCCG ACAACTCCTT CCCCGGAGCC CCGGATGCGG CGGCGCTGTT CCAGCAGTCG CTGGCAGCGG CAGGCATCAA CCTCGAGATC AAGCGTGAGC CCAACGATGG CTACTGGTCC GAGGTCTGGA ACAAGCAGCC CTTCTGCACC TCCTACTGGG GTGGCCGTCC GACGCAGGAC TCCATGTTCT CCACCGCCTA TCTGTCGACC GCAGACTGGA ACGACACCCG TTTCAAGAAC GAGCAGTTCG ACCAGACACT GCTGGCGGCC CGTGGCGAGC TCGACGAAGC CAAGCGCACC CAGATGTATG CAGATATGTC GACCATGGTG CGCGACGAAG GCGGCCTGAT CTGCCCGATG TTCAACGACT TTGTCGATGC AACGTCGGAT CGTCTGGACG GCTGGAAAGA CGGCGTCAAA GGTCACGCTC TCATGAACGG TTACGCGCCC CTGAAAATGT GGGTCAAAGC CTGA
|
Protein sequence | MNDYLKHLAQ EAALGRMGRR EFLGRAAALG VTALGANSLL ASAAKAAGAQ KGGTLRVGLQ GGSTTDSLDP ALATNQVALS VIRLWGEPLV ELGEEGGLVG AVAESFEASA DAKTWTFKLR SGLTFSNGQP VTAADVVATM ERHSGEDTKS GALGIMRSIS NVKADGDTVV FELEDANADL PYLLTDYHLV IQPNGGKDDP AAAIGTGPYV LKAVDMGVRF VAEKNPNYWG DLGNAQTIEY VVINDNTARV AALQAGQVDM IDRVPPRTAK LVDRAPNISV HSTAGPGHYV FIAHCDTDPF ANNDVRLALK YGINRQEMVD KILNGFGSVG NDSPINASYP LFTQLEQREY DPEKAKFHME KSGYDGPILL RTSDNSFPGA PDAAALFQQS LAAAGINLEI KREPNDGYWS EVWNKQPFCT SYWGGRPTQD SMFSTAYLST ADWNDTRFKN EQFDQTLLAA RGELDEAKRT QMYADMSTMV RDEGGLICPM FNDFVDATSD RLDGWKDGVK GHALMNGYAP LKMWVKA
|
| |