Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3220 |
Symbol | |
ID | 4075362 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 216964 |
End bp | 217980 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 638004729 |
Product | TRAP dicarboxylate transporter- DctP subunit |
Protein accession | YP_611456 |
Protein GI | 99078198 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00317601 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAT TCTTGGGACT TACCGCTGTT GCATCCGTGA GCTTTGCCTT TGCGGCAGAA GCCATGGCCA CAGAGTGGAA TGTCTCGGTT TGGGGCAAAC GCCGCGCCTT TACCGAGCAC GTCGAAAAGC TCGCTGAACT GGTGTCCGAG AAAACCGACG GCGAATTCAC CATGAACATC AGCTATGGTG GACTGTCCAA AAACCGCGAG AACCTCGACG GGATCTCGAT TGGCGCGTTT GAGATGGCGC AGTTCTGCGC CGGCTATCAC CGCGACAAAA ACCGCGTGAT TACCGTTCTT GAATTGCCCT TCCTGGGCAT TTCCAACCTC GAAGAGGAGG TTGCGGTCTC TAGCGCGGTC TACAACCACC CGGCCGCAGC CGAGGAAATG GCGCAGTGGA ACGCAAAGCT GCTCATGACC TCGCCGATGC CGCAATACAA TATCGTCGGC ACCGGTGATG TGCGTGATGA TCTGGCGGAA TTTGAAGGCA TGCGCGTGCG GGCAACCGGC GGTATCGGCG AAGCCTTCAA GGCTGTTGGC GCCGTTCCGA CCTCCGTCAC CGCGACCGAG GCCTATCAGG CGATGGAATC CGGTGTGGTC GACACCGTAG CGTTCGCACA ACATGCGCAT CTGAGCTTTG GCACCATCAA CCGCGCTGAC TGGTGGACCG CTAACCTCAA CCCCGGCACC GTGAACTGCC CGGTTGTGGT CAATATTGAC GCTTACGAAA GCCTCTCTGA CGCCGAGCGC GAAGCGCTGG ACAGCTCGGT TGCCGAAGCG CTGGATCACT ACCTGGCGAA CTACGGCGAG CTGCTGAAGA AGTGGGATAG TGTTCTCGAG GAAAAAGGCG TCGAAAAGGT CGAGATTTCG GAAGAGGTGC TCGCAGAATT CCGCTCTACT GCGGCTGAGC CGATCCGCGA CGCTTGGATC AAGGATATGG AAGCACAGGG CCTGCCGGGT CAGGAGCTCT ATGATCTAGT TCAGAAAACG CTCGCAGATC ACCGCAACGG CAGCTGA
|
Protein sequence | MKKFLGLTAV ASVSFAFAAE AMATEWNVSV WGKRRAFTEH VEKLAELVSE KTDGEFTMNI SYGGLSKNRE NLDGISIGAF EMAQFCAGYH RDKNRVITVL ELPFLGISNL EEEVAVSSAV YNHPAAAEEM AQWNAKLLMT SPMPQYNIVG TGDVRDDLAE FEGMRVRATG GIGEAFKAVG AVPTSVTATE AYQAMESGVV DTVAFAQHAH LSFGTINRAD WWTANLNPGT VNCPVVVNID AYESLSDAER EALDSSVAEA LDHYLANYGE LLKKWDSVLE EKGVEKVEIS EEVLAEFRST AAEPIRDAWI KDMEAQGLPG QELYDLVQKT LADHRNGS
|
| |