Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_3121 |
Symbol | |
ID | 6410792 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 3366555 |
End bp | 3367574 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 642713001 |
Product | TRAP dicarboxylate transporter, DctP subunit |
Protein accession | YP_001992102 |
Protein GI | 192291497 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component |
TIGRFAM ID | [TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTTTT CACGCAGGAG CTTGTTGAAG GCGTCGGCGG CGGCCGCCGC GATCGGCGGC ATCGGTGCGC CTTGGGTGGC GCGCGCCGCT GAGGCCGAAT TCAAGTACAA ATACGCCAAC AACCTGCCCG ACACCCATCC GCTCAACGTT CGCGCCCGCG AGATGTCGGC GGCGATCAAG GCCGAGACCG ACGGCAAGGT CCAGATCGAC GTTTTCCCGA ACAACCAGCT CGGCTCCGAC ACCGACATGC TGAGCCAGAT TCGCGCCGGC GGCGTCGAGT TCTTCACGCT GTCGGGCCTG ATCCTGTCGA CCCTGGTGCC GGCAGCTTCG ATCAACGGCA TCGGCTTCGC ATTCCCGGAC TATGAGACGG TCTGGAAGGC CATGGATGGC GAGCTCGGCG GCTATGTCCG CGGCGAGATC CAGAAGGCCG GCCTGATGGT GATGGACAAG ATCTGGGACA ACGGCTTCCG CCAGACCACG TCGTCGACCA AGCCGATCAA CGGCCCGGAC GACTTCAAGG GCTTCAAGAT TCGCGTGCCG GTGTCGCCGC TGTGGACCTC GATGTTCAAG GCGTTCGATG CGGCCCCCGC CTCGATCAAT TTCAGCGAAG TCTATTCGGC GCTGCAGACC AAGGTGGTCG AAGGCCAAGA GAACCCGCTG GTGCTGATCT CGGCCGCCAA GCTGTACGAA GTCCAGAAGT ACTGCTCGCT GACCAACCAC ATGTGGGACG GCTTCTGGTT CCTGGCCAAC CGCCGCGCCT GGGAAAAGCT GCCGCCGGAC GTGCGCACGA TTGTCGCCAA GCACATCAAC GCGGCTGCGG TAAAGGAACG CGAAGACACC GCCAAGCTCA ACGCCACCGT CAAGGAAGAG CTGACCGCCA AGGGGCTGAT CTTCAATCAG CCGCCGGTGA TGCCGTTCCG CGACAAGCTG CGCAGCGCCG GCTTCTATGC CGAGTGGAAA GGCAAATACG GCGATCAGGC GTGGTCGCTG CTCGAGAAGT CAGTCGGCAA GCTGGCGTAA
|
Protein sequence | MSFSRRSLLK ASAAAAAIGG IGAPWVARAA EAEFKYKYAN NLPDTHPLNV RAREMSAAIK AETDGKVQID VFPNNQLGSD TDMLSQIRAG GVEFFTLSGL ILSTLVPAAS INGIGFAFPD YETVWKAMDG ELGGYVRGEI QKAGLMVMDK IWDNGFRQTT SSTKPINGPD DFKGFKIRVP VSPLWTSMFK AFDAAPASIN FSEVYSALQT KVVEGQENPL VLISAAKLYE VQKYCSLTNH MWDGFWFLAN RRAWEKLPPD VRTIVAKHIN AAAVKEREDT AKLNATVKEE LTAKGLIFNQ PPVMPFRDKL RSAGFYAEWK GKYGDQAWSL LEKSVGKLA
|
| |