Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3571 |
Symbol | |
ID | 4898378 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | + |
Start bp | 661683 |
End bp | 662663 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640114180 |
Product | TRAP dicarboxylate transporter, DctP subunit |
Protein accession | YP_001045434 |
Protein GI | 126464321 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component |
TIGRFAM ID | [TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.303866 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGCGA AAACAAAGGC CGGTCTCGGC CTGACGGTCG GACTCATGCT CATGGCCGGC ACGGCGACCG CGCGGACCTT GACGCTGGGC ACCGTCTACG GAGCCCGCGA CGTCAGCACC CAGGCCATGG AACATTGGAA CGAGGCGCTG AGCGAGGCCA CGGAGGGGCG GTGGTCGCTG TCGATCGTGC CGGGCGGCAC CCTCGGCGGA GACCGCGAGA TGCTTCAGCA GCTCTCGACC GGCGAGATCG ACATCAATCT CTCCTCGCCC GTGGTCATGC AGTATGTTGC GCCGCAATAT CAGTGCCTTG AGGCGGAATA TATCTACGAT TCCGAAGAGC AGGGCTTCGC TGTGTGGCGC GGGGACATCG GCAAGGCCGC CTCGCAGGCC ATGAAGGACG CCCATGGCAT CGAGATCGCC GCCGTCGGCC GCCGGGGCGC GCGCCTCGTG ACGGCGAACA AGCCGATCCT GAAGCCGGAA GATCTGGCGG GCCTGAAGTT CCGCGTCACC AACAACCTCA GGTCCGAGGT CTTCGCCGCC TATGGCGCAC AGCCCGCGCC GCTTCCCCTG TCGGAGCTCT ATGGCGCGCT GCGTCAGGGC GTGTTCGATG CGCAGGAGAA CCCGCTCTCC ACGATCTTCA GCCTGCGCTT CCACGAGGTT CAGAGCCACA TCAGCGAGAC CAACCACATC TGGACCTACA ATCTGGTGCT GACCAACAGC GCCCTGATGG ACGAACTGGG CGAGGATCGC GCCGCGTTCG AAAGCACGCT GGCCCAGTCG CTGGAGTGGC TCTACACGGC CATCGACGAA GAGAATGCCC GGATCCGGGC CGAGATCGAG GCCTCGGGCT CGGCCGTCTT CGACAAGCCC GACACGCAGG CCTTCCGCGA CGCCGCCCGC CCCATCCTCG CGGCCTATGC CGAGGAAAGC TGCGCGCCGG GGCTGCTCGA TGCGGTCGAC GCCGTGGCTG CATCGAACTG A
|
Protein sequence | MNAKTKAGLG LTVGLMLMAG TATARTLTLG TVYGARDVST QAMEHWNEAL SEATEGRWSL SIVPGGTLGG DREMLQQLST GEIDINLSSP VVMQYVAPQY QCLEAEYIYD SEEQGFAVWR GDIGKAASQA MKDAHGIEIA AVGRRGARLV TANKPILKPE DLAGLKFRVT NNLRSEVFAA YGAQPAPLPL SELYGALRQG VFDAQENPLS TIFSLRFHEV QSHISETNHI WTYNLVLTNS ALMDELGEDR AAFESTLAQS LEWLYTAIDE ENARIRAEIE ASGSAVFDKP DTQAFRDAAR PILAAYAEES CAPGLLDAVD AVAASN
|
| |