Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1417 |
Symbol | |
ID | 4078047 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 1510334 |
End bp | 1512001 |
Gene Length | 1668 bp |
Protein Length | 555 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 638006727 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_613412 |
Protein GI | 99081258 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.628959 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.153951 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACCA CCAAACTCAA GAGCTCGACG ACCCATCATT GGGCGGCTGA GATGCATGCG CAGGAACTGA AGGCGGGCAA GATCTCGCGC CGCGAGTTCC TGACCCGTGC GACAGCGCTT GGTGTCTCGG CCACCGCCGC CTATGGGATG ATCGGCCTGA GCGCCCCTGC CCGTGCGGCG GACAACATGA AAGAAGGCGG CACCCTGCGC GTGCAAATGA GCGTGCGCGG CCTCAAGGAT CCCCGCACCT TCGACTGGTC CGAAATCGGC AACCAGGCGC GTGGCACGCT TGAATACCTG GTTGAATACA ACAACGACGG CTCCTTCACC CCGATGCTCC TGGAGAGCTG GCAGGTGAAT GACAGCGCCG ATGAATACAC CCTGAACGTT CGCAAGGGCG TCAAATGGAA CAACGGCGAC GACTTTACCG CCGAAGATGT GGCCCGCAAC ATCGAAGGTT GGTGCGACAA GGACATGGAA GGCAACTCCA TGGCTGGCCG CTTTGCCACG CTTGTCGATC CCGACACCGG CAAAGCCATC GAGGGCGCGA TCGAGGTGGT AGACAGCCAC ACCGTGAAAC TGAAACTGCC TAACTCCGAC ATTTCCATCA TTCCGGGCAT GGCGGATTAT CCGGCAGCGA TCACCCATGC GAGCTTTAGC GCGGATGATG CGCAAAACGC CGTGGGCACC GGTCCCTATC TGATTTCCGA GCTCGAAGTC GGCGTCAAAT GCGTGCTCAC GCGCAATGAG AACCACAGCT GGTGGGGTGA AGCGATCTAC GGCAAGCCGG CGCTCGACAG CATCGAATAC ATCGATTTCG GCACCGATCC GTCGTCGTGG CTGGCAGCGC TGGAATCCGA AGAAGTCGAT ATGCTCTATG AATCCGTTGG CGAATTCATC GACTTGATGG ACTCGCTTGG CTTCGTGAAA TCCGAAGTTG TCACCATGGC AACCATCGTC GTGCGTCCCA ACCAGCTGGC CGAGGTGGAC GGCATGTCCC CCTATGCCGA CAAACGCGTG CGCAAGGCGC TGCAGATGGC GGTGGACAAT GATGTCTGCC TGGAGCTTGG CTATGGCAAC CGCGGTGTTG CTGCGGAAAA CCATCACGTC GGCCCGGCGC ATCCTGAATA TGCCGAGCTC CCTAAAGAGG CGGCTGATCC CGATGGCGCC CGCGCGCTGA TGGAAGAAGC CGGCATGATG GATTTCGAAC ATGAGCTCTT CTCCATCGAC GATGACTGGC GCAAGAACAC CACCGATGCG GTCGCTGCCC AGCTGCGTGA CGCGGGGTTC AAGGTCAAGC GGACCGTGCT GCCGGGATCG ACCTTCTGGA ATGACTGGAC CAAATATCCG TTCAGCTCGA CCAACTGGAA CCACCGTCCC TTGGGTGTTC AGATCTGGGC GCTGGCCTAC CGTTCAGGTG AGGCCTGGAA CGAATTCGGC TGGTCCAACC CCGAGTTCGA CCAGATCCTG GCCGACGCGC TTGCCATCGC CGATGCCGAC GCGCGCCGCG AAGTCGTCGC CAAAGGTCAG AAGATGATCC AGGACGAGGG TGTCACCATC CAGCCCTATT GGAAATCCCT GTTCCGTCAC ATGCGCGAAG GCGTGCGCGG TGCCGATATG CATATCTCCT ACGAGCACCA TCACTACAAA TGGGGCTGGG CTGCCTGA
|
Protein sequence | MTTTKLKSST THHWAAEMHA QELKAGKISR REFLTRATAL GVSATAAYGM IGLSAPARAA DNMKEGGTLR VQMSVRGLKD PRTFDWSEIG NQARGTLEYL VEYNNDGSFT PMLLESWQVN DSADEYTLNV RKGVKWNNGD DFTAEDVARN IEGWCDKDME GNSMAGRFAT LVDPDTGKAI EGAIEVVDSH TVKLKLPNSD ISIIPGMADY PAAITHASFS ADDAQNAVGT GPYLISELEV GVKCVLTRNE NHSWWGEAIY GKPALDSIEY IDFGTDPSSW LAALESEEVD MLYESVGEFI DLMDSLGFVK SEVVTMATIV VRPNQLAEVD GMSPYADKRV RKALQMAVDN DVCLELGYGN RGVAAENHHV GPAHPEYAEL PKEAADPDGA RALMEEAGMM DFEHELFSID DDWRKNTTDA VAAQLRDAGF KVKRTVLPGS TFWNDWTKYP FSSTNWNHRP LGVQIWALAY RSGEAWNEFG WSNPEFDQIL ADALAIADAD ARREVVAKGQ KMIQDEGVTI QPYWKSLFRH MREGVRGADM HISYEHHHYK WGWAA
|
| |