Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3319 |
Symbol | |
ID | 4075724 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | - |
Start bp | 327591 |
End bp | 328571 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 638004827 |
Product | TRAP dicarboxylate transporter, DctP subunit |
Protein accession | YP_611553 |
Protein GI | 99078295 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component |
TIGRFAM ID | [TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0205242 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.450742 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGTTT TTTCCCGCAG TCTCCTGGCC GCAGGTGCCA GCTTTGCCTT TGCCGTTCCG CTGGCCGCAC AGACCGAGAT CAAGATTGGC TATGCACTGG CAGAGGACAG CCATTACGGC GTGGCTGCCA AGACATTCGA AGAGGTTGTG CTGGAGCAGA CCGGTGAGGA TTTCAGCTTC ACGCATTTCC CGTCGTCCGG TCTGGGCGGT GAGCGCGATG TGATCGAGGG CCTGCAGCTT GGCACCGTGG AAGTCACCAT CGTGTCTTCC GGCACGCTGG CCAACTTTGT CCCTGAAACT GGTGTTTTCG ACATCCCGTT CCTGTTCCGG GATCTTGGCC ACGCCCGCTC GGTGCTCGAC GGCCCCATCG GTCAGGACAT CCTTGAAAAG TTTGACGCTG TTGGCCTGCA TGCGCTGGCA TGGGGCGAGC AGGGCTTTCG CCATATCACC AACAACCGTA ATGCAATCAA CACTCCTGCC GACGTTCAGG GGCTGAAGCT GCGCACAATG GAAAACCCGG TCCACCTCGC GGCGTTCAAC GCGATGGGCG CCGCGCCGAC ACCGATGGCG TGGCCCGAGG TGATCTCTTC CATGCAGCAA GGGGTGATCG ACGGACAGGA AAACCCGCTC TCGGTGATCG TTTCGGTGAA ACTGGACGAA GTGCAGAAAT ACCTGACCCT CTCCGGTCAC GTTTATTCGC CTGCGATGCT CTTGGTGTCC AAACCCTTCT GGGAAGGTCT GAATGACGAG CAAAAGGCTG CGTTTGAAGC CGCCGCCGCC GAGGCCGTGG GTGCCATGCG CGGATACGTC GATGGCATCG AAGCCAGCGG TGTTGAAACG CTCAAGGAAC GCGGCATGGA AGTGAACGCG CTGAGCGCCG ATGAAAAAGC CGCGTTCCAA GCGTCAATCC AGTCTGCCTA CGAGGGCTAT TACAAGACCT ATGGCGAGGA TCTCGTGAAA TCGATCGTCG CGGCTGAGTG A
|
Protein sequence | MTVFSRSLLA AGASFAFAVP LAAQTEIKIG YALAEDSHYG VAAKTFEEVV LEQTGEDFSF THFPSSGLGG ERDVIEGLQL GTVEVTIVSS GTLANFVPET GVFDIPFLFR DLGHARSVLD GPIGQDILEK FDAVGLHALA WGEQGFRHIT NNRNAINTPA DVQGLKLRTM ENPVHLAAFN AMGAAPTPMA WPEVISSMQQ GVIDGQENPL SVIVSVKLDE VQKYLTLSGH VYSPAMLLVS KPFWEGLNDE QKAAFEAAAA EAVGAMRGYV DGIEASGVET LKERGMEVNA LSADEKAAFQ ASIQSAYEGY YKTYGEDLVK SIVAAE
|
| |