Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3222 |
Symbol | |
ID | 4075364 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 218740 |
End bp | 220140 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 638004731 |
Product | TRAP C4-dicarboxylate transport system permease DctM subunit |
Protein accession | YP_611458 |
Protein GI | 99078200 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1593] TRAP-type C4-dicarboxylate transport system, large permease component |
TIGRFAM ID | [TIGR00786] TRAP transporter, DctM subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00234978 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGATA TCGAAATCGG CCTCTGGGTC ACCGCCGGAA TGATGGTGCT TGTTGTGCTG GGCATGCGGG TTGCGTTTGC CGCCGGACTT GCGGGCTTTG TGGGTCTTGT GTGGCTTCGG TGGAACGGCT TTGACTATAA CCCGGAGCGT TTCTGGAAAG CTGTTGAAAT CAGTGTAAAA ATTGCGGGCC AAGTACCGCA CTCCAAGGTT TCGAGTCAGG CGCTCAGCCT CATCCCGACC TTTATTTTAA TTGGATACCT CGCCTACTAC GCCCGCCTCA CCACGGCACT GTTTGAGGCC GCAAAGCGCT GGGTTGCCTG GGTGCCCGGG GGCCTTGCGG TATCCACCGT TTTTGCCACC GCAGGGTTTG CCGCCGTTTC GGGCGCATCG GTTGCAACCG CTGCGGTCTT TGCCCGGATC GGCATCCCGG AAATGCTGGC GGTTGGCTAC AACAAGCGTT TTGCAGCCGG GGTGGTCGCC GCAGGCGGCA CCCTCGCCTC TCTGATCCCG CCCTCCGCCA TTCTCGTGAT CTATGCAATC ATTGTGGAGC AGGACGTGGG CAAGCTCTTG CTGGCCGGCT TTGTGCCCGG TGCGTTTTCG GCCGTGGTCT ATGCGGGCTT GATCATCGCC ATCGCGATGA TCTTCAAGAC GGTCGGCCCA CCAGTCACGG GCTTCACCTG GCGCGAACGC CTGGTGTCCT TGCCACCTGC CCTGCCGATT TTTGCGGTTG TCGTGATCAT CATCTTCTTT GTCTACAACC CGTTTGGCGA AGCCTGGGGG ACACCAACCG AGGGCGGTGC TGTCGGCGCC TTCATCGTCT TTCTCGTCGC GCTAATGCGC GGGATGCGGA TGCGCGAACT GCTGGATGCG CTGCTTGAGA CTGCCAAACT CACGATTATG ATCTTCACCA TCATCTGGGG TGTTTTGATC TACGTGCGCT TTCTGGGCTT TGCAGATTTG CCTTCGGCCT TCGCCGACTG GATCTCGACC CTCTCGGCCT CGCCCATGCT GATCCTAATC TGTATTCTGC TTGCCTATGC GGTTCTTGGG ATGTTCATGG ACGCCATCGG CATGCTTCTT CTGACGCTGC CTGTTGTCTA CCCGGCCGTC ATGGCGTTGA ACGGAGGCGA GATGGTCTCG GCAGCGGATT CTGCCTTCGG CATGTCCGGC CCAATGTGCG CGATCTGGTT TGGCATCCTG GTGGTCAAGA TGGCGGAGTT CTGTCTGATC ACGCCCCCAA TCGGGCTGAA TTGCTTTGTT GTAGCGGGGG TACGAGACGA TTTGTCAGTA CAAGACGTGT TCCGCGGCGT GATCCCGTTC TTCATCGCAG ACGCTGTGAC AATCGCGCTC TTGGTGGCCT TTCCTACCAT TGTCTTGTGG CTCCCAAGCT TGGCAGGCTA G
|
Protein sequence | MTDIEIGLWV TAGMMVLVVL GMRVAFAAGL AGFVGLVWLR WNGFDYNPER FWKAVEISVK IAGQVPHSKV SSQALSLIPT FILIGYLAYY ARLTTALFEA AKRWVAWVPG GLAVSTVFAT AGFAAVSGAS VATAAVFARI GIPEMLAVGY NKRFAAGVVA AGGTLASLIP PSAILVIYAI IVEQDVGKLL LAGFVPGAFS AVVYAGLIIA IAMIFKTVGP PVTGFTWRER LVSLPPALPI FAVVVIIIFF VYNPFGEAWG TPTEGGAVGA FIVFLVALMR GMRMRELLDA LLETAKLTIM IFTIIWGVLI YVRFLGFADL PSAFADWIST LSASPMLILI CILLAYAVLG MFMDAIGMLL LTLPVVYPAV MALNGGEMVS AADSAFGMSG PMCAIWFGIL VVKMAEFCLI TPPIGLNCFV VAGVRDDLSV QDVFRGVIPF FIADAVTIAL LVAFPTIVLW LPSLAG
|
| |