Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5039 |
Symbol | |
ID | 5319088 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 1559239 |
End bp | 1560114 |
Gene Length | 876 bp |
Protein Length | 291 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640776820 |
Product | binding-protein-dependent transport systems inner membrane component |
Protein accession | YP_001313752 |
Protein GI | 150377156 |
COG category | [E] Amino acid transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG1173] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGCCC AAATCGTAAG CGCGCTTAAG AGGCGCCGAC ACGGACCGGT CCTGACCGCG TCCAAGCTGC TGCTGGCAAG CTACATCCTG GTTGCCATCG CTGCACCCCT AATCGCGCCG CAAAACCCCT ATGACCCGCT GCAGATCTAC GGCTGGGAGG CGTCCTCGCC ACCCGGTACA CGTGGCAGCG GCGGCTATCT CTACCTGCTT GGAACCGACG GTCTCGGCCG AGACATCGTC AGCACCATTC TCTACGGCCT GCGGATCAGC CTGGTGGTGT CGATCGTCAG TAGCGCGCTC GCGGCGCTGA TCGGGCTGAC CGCCGGCGTC AGTGCCGCAT ATTTCGGGAA ATGGGTCGAC ATCGCGATTA TGCGGCTGGT CGACCTTCAG CTCAGCCTGC CGACGATCCT GATTGCGCTC ATCGCCATCG TCACGCTCGG GCCGGGCATA GACCGTATCA TCCTGGCGTT GATCATCGCG CAATGGGCGA CCTATGCCCG GATCGCCCGC GGCGTCGCAC TCAGCGAAGC GAACAAGCCT TACATGGATG CGGCCCGCTT GATGCGGCTG CCAACGTCCC GGATCATCTT CCGCCACCTG CTTCCGAACA GCATCGCGCC CGCCGTCACC TTGATACCGA TCGAGGTCGG CCATGCCGTC GCACTCGAGG CGACCCTCTC CTTTCTTGGG CTGGGCGTCC CGATCGACAA GCCCTCGCTC GGCTCAGCGG TCGCAAACGG GTTCCAGTAC CTCCTGACCG GCCAATACTG GATCAGCCTG TTCCCGGGCC TCGCCCTCTT CGGCCTGATC GCCAGCATCA ATCTCGTCGG CGAAGACGTT CGCCGCAGCC TCGATCCAAG GAACCATCCG CTATGA
|
Protein sequence | MKAQIVSALK RRRHGPVLTA SKLLLASYIL VAIAAPLIAP QNPYDPLQIY GWEASSPPGT RGSGGYLYLL GTDGLGRDIV STILYGLRIS LVVSIVSSAL AALIGLTAGV SAAYFGKWVD IAIMRLVDLQ LSLPTILIAL IAIVTLGPGI DRIILALIIA QWATYARIAR GVALSEANKP YMDAARLMRL PTSRIIFRHL LPNSIAPAVT LIPIEVGHAV ALEATLSFLG LGVPIDKPSL GSAVANGFQY LLTGQYWISL FPGLALFGLI ASINLVGEDV RRSLDPRNHP L
|
| |