Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3398 |
Symbol | |
ID | 4898289 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | - |
Start bp | 452064 |
End bp | 453059 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640113995 |
Product | TRAP dicarboxylate transporter, DctP subunit |
Protein accession | YP_001045263 |
Protein GI | 126464150 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component |
TIGRFAM ID | [TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.586433 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTGA TCCACGGCCT TCTGGCCGCC GCCCTTCTTG CGACTGGCGC ACAGGCGCAG GACTACAGCA GCCGCACCAT CAAGTTCGCC GCCACCGGTC AGGAAGGTAC GCCGCCGGTG CAGGGCATGC ATATCTTCGC GCAGAAGCTC GAGGAGCAGA GCGGCGGCAA GCTGAAGACG CGCGTCTTCG CCAATGGCGT GCTGGGCGGC GATGTGCAGG TGCTGTCGTC GCTTCAGGGC GGCGTGGTCG AGATGATGGT CTGGAACGCC GGCAACATGA TGACCCAGGC GCAGGATTTC GGCATCCTCG ATCTGCCCTT CATCTATCAG GACGAAGAGG TGATGGATAC GCTGCTCGAC GGCGAAGTCG GCAGGAAGCT CACCGATCAG CTGCCCGAGC ATGGCGTGAT CGGCCTGTCC TTCTGGGAAC AGGGCTTCCG CCAGCTGACC AACGACACCC GCGAGGTGCA CAGGCTCGAG GATATTGCGG GCCTCAAGGT CCGCGTGCAG CAGAACCCGC TGCTCGTCGA CATGTGGAAG GCGCTTGGCG CCAATCCCAC GCCGATGGCG GTGACCGAAC TCTACACCGC GCTCGAGACC GGCGCCGTGG ACGGGCAGGA ATGCACCGCG CCCTTCGCTC TCACCGCGAA ATATACCGAG GTGCAGAAAT ATCTCTCGGT CACCCGCCAC AACTACAATC CGCAGATCGT GCTGATCGGC AAACCCTTCT GGGACAAGCT CACCGACGAT GAAAAGGCCC TGATCCAGAA GGTCGCGCAG GAGACTGCGG TCGAACAGCG CCGCATTTCG CGCGCGGCGC AGGACAGCGC GCTGGAGGAG ATCCGGGCGG CTGGCAATGT CGTGACCGAG ATCACCCCCG AAGAGCTCGC CCGCATGCAG GAGGCCGTCG CCCCGGTCAT CCGCACCTAT GCACAGACCT TCGATCCCGA GCTCGTGCGC ACCGTCTTCG ATGCGGTCGG CTTCTCGCTG GATTGA
|
Protein sequence | MKLIHGLLAA ALLATGAQAQ DYSSRTIKFA ATGQEGTPPV QGMHIFAQKL EEQSGGKLKT RVFANGVLGG DVQVLSSLQG GVVEMMVWNA GNMMTQAQDF GILDLPFIYQ DEEVMDTLLD GEVGRKLTDQ LPEHGVIGLS FWEQGFRQLT NDTREVHRLE DIAGLKVRVQ QNPLLVDMWK ALGANPTPMA VTELYTALET GAVDGQECTA PFALTAKYTE VQKYLSVTRH NYNPQIVLIG KPFWDKLTDD EKALIQKVAQ ETAVEQRRIS RAAQDSALEE IRAAGNVVTE ITPEELARMQ EAVAPVIRTY AQTFDPELVR TVFDAVGFSL D
|
| |