Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_2569 |
Symbol | |
ID | 4895814 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | + |
Start bp | 2706730 |
End bp | 2707734 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640113168 |
Product | TRAP dicarboxylate transporter, DctP subunit |
Protein accession | YP_001044443 |
Protein GI | 126463329 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component |
TIGRFAM ID | [TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.282286 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.691286 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGACCC GTCGCAGCCT CGCCGCTCTG GCAGGCGCCG CCGCGCTGGC GCTCGCCGCC GCCGTGCCGG CTCTCGCCCA GCCGATCGTC ATCAAGTTCA GCCACGTCGT CGCCCCCGAC ACGCCGAAGG GCAAGGGCGC CACGAAGTTC GAGGAACTGG CGGAGAAATA CACCGACGGC GCGGTGGATG TCGAAGTCTA CCCCAACAGC CAGCTCTACA AGGACAAGGA AGAGCTCGAG GCGCTGCAGC TCGGCGCGGT CCAGATGCTC GCCCCGTCGC TGGCCAAGTT CGGCCCGCTC GGCGTGCAGG ATTTCGAGGT CTTCGACCTG CCCTACATCT TCAAGGGCTA TGACGCGCTG CACACCGTGA CCAACGGCGA GGTGGGCAAG ATGCTGTTCT CGAAGCTCGA GGACAAGGGC ATCAAGGGCC TCGCCTACTG GGACAACGGC TTCAAGATCA TGTCGGCCAA CAGCCCGATC GCCACGCCCG ACGACTTCCT CGGGCTGAAG ATGCGCATCC AGTCCTCGAA GGTGCTCGAG GCGCAGATGA ACGCGCTCGG CGCGGTGCCG CAGGTCATGG CCTTCTCCGA GGTCTATCAG GCGCTGCAGA CCGGCGTCGT GGACGGCACC GAGAACCCGC CCTCGAACAT GTATACCCAG AAGATGCACG AGGTGCAGAA GCACGCCACG GTCTCGAACC ACGGCTACCT CGGCTATGCG GTGATCGTGA ACAAGCAGTT CTGGGACGGC CTGCCCGAAG AGGTGCGCGC CGGGCTCGAG AAGGCGCTGA CCGAGGCCAC CGACTATGCC AACGGCATCG CCAAGGAAGA GAACGACAAG GCGCTGCAGG CGATGAAGGA CGCGGGCACG ACCGAGTTCC ACGAGCTGAC CCCCGAAGAG CTCGCGGCCT GGGAAGAGGT GCTCGCCCCC GTCCATGAGG AAATGGCCGG CCGCATCGGC GCCGAGACCA TCGCCGCCGT GAAGGCCGCG ACCGGGACCA ACTGA
|
Protein sequence | MLTRRSLAAL AGAAALALAA AVPALAQPIV IKFSHVVAPD TPKGKGATKF EELAEKYTDG AVDVEVYPNS QLYKDKEELE ALQLGAVQML APSLAKFGPL GVQDFEVFDL PYIFKGYDAL HTVTNGEVGK MLFSKLEDKG IKGLAYWDNG FKIMSANSPI ATPDDFLGLK MRIQSSKVLE AQMNALGAVP QVMAFSEVYQ ALQTGVVDGT ENPPSNMYTQ KMHEVQKHAT VSNHGYLGYA VIVNKQFWDG LPEEVRAGLE KALTEATDYA NGIAKEENDK ALQAMKDAGT TEFHELTPEE LAAWEEVLAP VHEEMAGRIG AETIAAVKAA TGTN
|
| |