Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSP_3661 |
Symbol | |
ID | 3722150 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides 2.4.1 |
Kingdom | Bacteria |
Replicon accession | NC_007494 |
Strand | - |
Start bp | 768699 |
End bp | 769694 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640073334 |
Product | TRAP-T family transporter periplasmic binding protein |
Protein accession | YP_355171 |
Protein GI | 77465668 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component |
TIGRFAM ID | [TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACTGA TCCACGGCCT TCTGGCCGCC GCCCTTCTTG CGACCGGCGC ACAGGCGCAG GACTACAGCA GCCGCACCAT CAAGTTCGCC GCCACCGGTC AGGAAGGCAC GCCTCCGGTG CAGGGCATGC ATATCTTCGC GCAGAAGCTC GAGGAGCAGA GCGGCGGCAA GCTGAAGACG CGCGTCTTCG CCAATGGCGT GCTCGGCGGC GATGTGCAGG TGCTGTCGTC GCTTCAGGGC GGCGTGGTCG AGATGATGGT CTGGAACGCC GGCAACATGA TGACCCAGGC GCAGGATTTC GGCATCCTCG ATCTGCCCTT CATCTATCAG GACGAAGAGG TGATGGATGC GCTGCTCGAC GGCGAGGTCG GCAGGAAGCT CACCGATCAG CTGCCCGAAC ATGGCGTGAT CGGCCTGTCC TTCTGGGAAC AGGGCTTCCG CCAGCTGACC AACGACACCC GCGAGGTGCA CAGGCTCGAG GATATCGCGG GCCTCAAGGT CCGCGTGCAG CAGAACCCGC TTCTCGTCGA CATGTGGCGG GCGCTCGGTG CCAATCCCAC GCCGATGGCG GTGACCGAAC TCTACACCGC GCTCGAGACC GGCGCCGTGG ACGGGCAGGA ATGCACCGCG CCCTTCGCTC TCACCGCGAA ATATACCGAG GTGCAGAAAT ATCTCTCGGT CACCCGCCAC AACTACAATC CGCAGATCGT GCTGATCGGC AAACCCTTCT GGGACAAGCT CACCGACGAT GAAAAGGCCC TGATCCAGAA GGTCGCGCAG GAGACTGCGG TCGAACAGCG CCGCATTTCG CGCGCGGCGC AGGACAGCGC GCTGGAGGAG ATCCGGGCGG CCGGCAATGT CGTGACCGAG ATCACCCCCG AAGAGCTCGC CCGCATGCAG GAGGCCGTCG CCCCGGTCAT CCGCACCTAT GCACAGACCT TCGATCCCGA GCTCGTGCGC ACCGTCTTCG ATGCGGTCGG CTTCTCGCTG GATTGA
|
Protein sequence | MKLIHGLLAA ALLATGAQAQ DYSSRTIKFA ATGQEGTPPV QGMHIFAQKL EEQSGGKLKT RVFANGVLGG DVQVLSSLQG GVVEMMVWNA GNMMTQAQDF GILDLPFIYQ DEEVMDALLD GEVGRKLTDQ LPEHGVIGLS FWEQGFRQLT NDTREVHRLE DIAGLKVRVQ QNPLLVDMWR ALGANPTPMA VTELYTALET GAVDGQECTA PFALTAKYTE VQKYLSVTRH NYNPQIVLIG KPFWDKLTDD EKALIQKVAQ ETAVEQRRIS RAAQDSALEE IRAAGNVVTE ITPEELARMQ EAVAPVIRTY AQTFDPELVR TVFDAVGFSL D
|
| |