Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2686 |
Symbol | |
ID | 3910479 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 3069926 |
End bp | 3070945 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637884586 |
Product | TRAP dicarboxylate transporter, DctP subunit |
Protein accession | YP_486299 |
Protein GI | 86749803 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component |
TIGRFAM ID | [TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGTTT CGCGCAGGAC ATTGTTGCAG GCATCGGCGG CGGCGGTCGC GCTCGGCGGC ATCGGGGCGC CGTTCGTGGC GCGCGCGGCG GAGGCCGAGT TCGTCTACAA ATACGCCAAC AACCTGCCCG ACACCCATCC GATGAACATC CGCGCCCGCG AGATGGCGGC GGCGATCAAG GCCGAGACCA ATGGCCGGGT CCAGATCGAC ATTTTCCCGA GCAACCAGCT CGGCTCCGAC ACCGACATGC TGAGCCAGAT CCGCTCCGGC GGCGTCGAGT TCTTCACACT GTCCGGCCTG ATCCTGTCGA CGCTGGTGCC GGCGGCCTCG ATCAACGGTA TCGGCTTCGC ATTCCCGGAC TACGACACGG TCTGGAAAGC GATGGACGGC GAGCTCGGCG GCTATGTGCG CGGCGAGATC GGCAAGGCCG GGCTGGTGGT GATGGACAAG ATCTGGGACA ACGGCTTCCG CCAGACCACG ACCTCGACCC GGCCGATCAC CGGCCCGGAC GACTTCAAGG GCCTCAAGAT CCGCGTGCCG GTGTCGCCGC TGTGGACCTC GATGTTCAAG GCGTTCGACG CCTCGCCCGC CTCGATCAAT TTCAGCGAGG TCTATTCGGC GCTGCAGACC AAGGTCGTCG AAGGCCAGGA GAACCCGCTG GCGATCATTT CGACCGCGAA GCTCTATGAA GTGCAGAAAT ACTGCTCGCT GACCAATCAT ATGTGGGACG GCTTCTGGTT CCTCGCCAAC CGCCGCGCCT GGGAACGGCT GCCAGCCGAT CTGCGCGACA TCGTCGCCAG GAACATCAAC GCCGCCGGCG TCAATCAGCG CGCCGACGTC GCCAAGCTCA ACGCCGGCCT GAAGGACGAA CTCGCCACCA AGGGCCTGAC CTTCAACCAG CCGACCATCG GGCCGTTCCG CGACAAGCTG CGCGCCGCCG GCTTCTACGC CGAATGGAAA GGCAAATACG GCGAGCAGGC CTGGTCGCTG CTGGAGAAAT CCGTCGGCAA GCTCGCCTGA
|
Protein sequence | MSVSRRTLLQ ASAAAVALGG IGAPFVARAA EAEFVYKYAN NLPDTHPMNI RAREMAAAIK AETNGRVQID IFPSNQLGSD TDMLSQIRSG GVEFFTLSGL ILSTLVPAAS INGIGFAFPD YDTVWKAMDG ELGGYVRGEI GKAGLVVMDK IWDNGFRQTT TSTRPITGPD DFKGLKIRVP VSPLWTSMFK AFDASPASIN FSEVYSALQT KVVEGQENPL AIISTAKLYE VQKYCSLTNH MWDGFWFLAN RRAWERLPAD LRDIVARNIN AAGVNQRADV AKLNAGLKDE LATKGLTFNQ PTIGPFRDKL RAAGFYAEWK GKYGEQAWSL LEKSVGKLA
|
| |