Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_2326 |
Symbol | |
ID | 5323187 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 2407024 |
End bp | 2408664 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640791264 |
Product | extracellular solute-binding protein |
Protein accession | YP_001327993 |
Protein GI | 150397526 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.294702 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGCACA AGCTCAATGG ACGTTTTCGA ATTCTCGCCG CCTCGGCGGC CCTCGCCATG GCGATGGGCG CGGCTCAGCC GGCCTTCGCG GAAACGCCGA AGGACACGCT GGTCGAGGGT TTCGCATTCG ACGACATCAT CACCATGGAT CCCGGCGAAG CCTTCGAGCT TTCGACCGCC GAGATGACGA GCAATACCTA CAGCCTGCTC GTGAGGCTCG ATCTCAACGA TACCTCCAAG GTCGTAGGCG ATCTTGCGGA AAGCTGGACG GTCTCGGATG ACGGCCTGAC CTATACGTTC AAGCTCAAGC CGGGAATGAA ATTCGCATCC GGCAATCCGA TCACCGCCGA GGACGTCGCC TATTCGTTCG AACGTGCCGT CAAGCTCGAC AAGAGCCCGG CCTTCATCCT CACCCAGTTC GGCCTTACCG GGGACAATGT CACGGAAAAG GCGAAGGCCG CCGACCCGGA GACCTTTGTT TTCACGGTCG ACCAGCCCTA CGCGCCGAGT TTCGTCTTGA ACTGTCTGAC CGCGACGGTT GCTTCCGTAG TCGACAAGAA GCTCGTGCTC GAACATGTGA AATCCGTGTC GCCGAGCGAC GAGTACAAGT ATGACAACGA CTTCGGCAAT GAGTGGCTGA AGACCGGTTA TGCCGGTTCC GGTCCGTTCA AGCTGCGCGA GTGGCGCGCC AATGAAGTCG TGGTGCTGGA ACGCAACGAC AATTATTATG GCGAACCGGC GAAACTCGCC CGCGTCATCT ACCGTCACAT GAAGGAAAGC TCGGGTCAGC GGCTCGCGCT TGAAGCCGGC GACATCGATG TCGCGCGCAA CCTCGAGCCC GGCGACTACG ACGCAGTCGG CAAGAATGCC GATCTGGCGA CGGCCAGCGC CCCGAAGGGA ACGGTCTACT ATATCAGCCT CAATCAGAAG AACGAAAAGC TCGCAAAACC CGAGGTACAG CAGGCGTTCA AGTATCTTGT CGATTACGAC GCGATTGGCT CGACCCTGAT CAAGGGCATC GGCGAGATTC ACCAGAGCTT CCTGCCGAAG GGTGTGCTGG GTGCCGTCGA CGAGAACCCC TACACCTTCG ACGTAGCCAA GGCGAAGGAA CTGCTGGCGA AGGCCGGCTA TCCGGACGGC TTCACCGTTA CGATGGATGT GCGTAATACC CAGCCGGTCA CCGGCATTGC CGAATCCTTC CAGCAGACGC TGGGGCAGGC GGGCGTGAAG CTCGAAATTA TTCCAGGAGA CGGCAAGCAG ACCCTGACCA AGTACCGCGC CCGCAATCAC GACATGTATA TCGGCCAGTG GGGCATGGAT TATTTCGATC CGCACTCCAA TGCCGATACC TTCACCAACA ATCCGGACAA TTCCGACGAA GGCACGAACA AGACGCTCGC CTGGCGCAAC GCCTGGGACG TTCCGGAACT CAGCAAGAAG ACCAAGGACG CGCTCCTCGA ACGCGACAGC ACAAAGCGCG CCGAGATCTA CAAGGAGCTG CAGAAAACGG TGCTCGAGGA CAGTCCTTTC GTCGTCATCT TCCAGCAGAC AGAGGTCGCC GGGTTGCGCG GCATTGTCGA GGGCTTCAAG CTCGGGCCGA GCTTCGACAC CAACTACGTC TGGAACGTCT CCAAGGAATA G
|
Protein sequence | MMHKLNGRFR ILAASAALAM AMGAAQPAFA ETPKDTLVEG FAFDDIITMD PGEAFELSTA EMTSNTYSLL VRLDLNDTSK VVGDLAESWT VSDDGLTYTF KLKPGMKFAS GNPITAEDVA YSFERAVKLD KSPAFILTQF GLTGDNVTEK AKAADPETFV FTVDQPYAPS FVLNCLTATV ASVVDKKLVL EHVKSVSPSD EYKYDNDFGN EWLKTGYAGS GPFKLREWRA NEVVVLERND NYYGEPAKLA RVIYRHMKES SGQRLALEAG DIDVARNLEP GDYDAVGKNA DLATASAPKG TVYYISLNQK NEKLAKPEVQ QAFKYLVDYD AIGSTLIKGI GEIHQSFLPK GVLGAVDENP YTFDVAKAKE LLAKAGYPDG FTVTMDVRNT QPVTGIAESF QQTLGQAGVK LEIIPGDGKQ TLTKYRARNH DMYIGQWGMD YFDPHSNADT FTNNPDNSDE GTNKTLAWRN AWDVPELSKK TKDALLERDS TKRAEIYKEL QKTVLEDSPF VVIFQQTEVA GLRGIVEGFK LGPSFDTNYV WNVSKE
|
| |