Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_6073 |
Symbol | |
ID | 6983428 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011370 |
Strand | + |
Start bp | 31 |
End bp | 1041 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643399099 |
Product | TRAP dicarboxylate transporter, DctP subunit |
Protein accession | YP_002283855 |
Protein GI | 209551939 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component |
TIGRFAM ID | [TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATCA CCCGTCGCAA CATGCTGGCA ACAACAGGTG CCTTGGCATT GGGCGCAGCG TTTCACACGC GAGCCGCCGC CGCCGAATTC ACATATAAAT TCGCCAATAA CCTGCCGCCC GATCACCCGG TCAATGTCAA GCTGAAGACG GCCGCCGACA ATATTCTGCA GGAAACGGGC GGTCGGCTGC AGATCAACCT ATTTCCCTCC GGCCAGCTCG GCAACGACAC CGAAACGCTA TCCCAACTGC GTAGTGGCGC GACGGAGTTC TTTTCGCTCT CGCCCTTGAT CCTCTCGACG CTCGCGCCCA ACGCGGCGAT TTCCGGAATG GGTTTCGCCT TTCCCGACTA TGATACGGTC TGGAGGGCCA TGGACGGCAA GCTCGGAGCC TACACGCGCG CGCAAATCGA GAAGAAGGGC ATCGTGCCCA TGGAGAAGAT TTGGGACAAC GGGTTTCGGC AGATCACCAG CTCGTCAAAG ACAATCAACA CGCCTGATGA TTTGAAAGGT TTCAAGATCC GGGTGCCTCC GAGCCCGCTC TGGCAATCCC TGTTCAACGC CTTCGGTGCA GCACCGGTCA CCATCAACGC CGCGGAAATG TACACCGCTC TCTCTACCGG CATCGCGGAC GGACAGGAAA ACGCCCTGAA TGTTGTGGAG GCGTTCAAAC TCTTCGAAGT GCAAAAGTAT TGCGCGATGA CGAGCCACAT GTGGGACGGT TTCTGGCTTC TTGCCAACAA GAACGCCTGG GAGGCGCTGC CGGAAGATGT GCGCGAGATA ACGTCGAAGA ATTTCAACGA CCAGGTCGAC TCCCAGCGGA AAGTCATTGC CGAGCTCAAC ACCTCGTTAC GCGACACGCT GACGAAGCGC GGAATGACGT TCACCGATCC AGACAACGCC GCTTTCAGAG AGAAACTTTC GAAGTCAGGC TTCTACGGCG AGTGGAGGAA AAAGTTCGGC GAAGAAGCGT GGGGAATTCT CGAAGAAGCG GTCGGCGCAC GCCTTGGCTA G
|
Protein sequence | MKITRRNMLA TTGALALGAA FHTRAAAAEF TYKFANNLPP DHPVNVKLKT AADNILQETG GRLQINLFPS GQLGNDTETL SQLRSGATEF FSLSPLILST LAPNAAISGM GFAFPDYDTV WRAMDGKLGA YTRAQIEKKG IVPMEKIWDN GFRQITSSSK TINTPDDLKG FKIRVPPSPL WQSLFNAFGA APVTINAAEM YTALSTGIAD GQENALNVVE AFKLFEVQKY CAMTSHMWDG FWLLANKNAW EALPEDVREI TSKNFNDQVD SQRKVIAELN TSLRDTLTKR GMTFTDPDNA AFREKLSKSG FYGEWRKKFG EEAWGILEEA VGARLG
|
| |