Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_2792 |
Symbol | |
ID | 5323662 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 2911576 |
End bp | 2912613 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640791737 |
Product | regulatory protein LacI |
Protein accession | YP_001328457 |
Protein GI | 150397990 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.527889 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGACCGA CAGTTCACGA TATCGCGGCC GAGGCGGGCG TCAGTCTCGC CACCGTCGAC CGGGTGTTGA ACAACCGCCC GGGCGTGAGA AGCGTCACGC GTAACAGGGT GGAGCGCGCG ATCGCCACGC TCGGCTACGT TCGCGACGTC GCCGCTGCCA ATCTCGCCAA GAGTCGCAGC TATCCCCTGG TCTTCATTCT GCCCTCCGGC GAAAACGCCT TCATGCGCGG TCTCGAGTCC GAGTTGCGCT TCGCGATGTC GCGTTCTGCG GCCGAGCGGC TGGATATCAC CATTCTTTCC GTTCCTGCCT TCGACGCGCC GGCGCTCGCC GCTGCCTTGC TCGACGCGCG CAAGCGCCGG CCCGCGGGCG TTGCCGTCGT CGCCGTCGAA GCGCCCGAAG TGACTGAGGC GGTGAAGCGG CTCTGCGAAG ACGGTGTTTC CGTCGTTACG CTGGTGTCCG ATCTGCCCGG ATCCGGGCGC GATCATTTCG CCGGCGTCGA TAACGTCGCG GCAGGTCGAA CCGCCGGCAG CCTGTTAGGC CGCTTTCTGG GCGGCCGCGA AGGGCCCGTC GCGGTGCTTG CCGGTTCCAT GCTGGTGCGC GACCACCGCG ACCGGCTGGA GGGTTTCCGG GCAGTGACAA GCGAGGATTT CGCCTCCCGA CAGGTTCTTC CGGTCATCGA GGGACAAGAC AACCCGTTGC TCGTCGAGAA GCTCGTAGGC GCGCTGCTCG AGCGGAATCC CGACCTTGCC GGCATCTACA GCCTCGGTGC CGGCAATCGC GGGCTCATAG CCGCACTCGA AAAGGCAGGC AGGGCAAAAT CCGTCTGTAC AATCGCGCAT GAATTGACCC CGCACAGCCG TGCAGCCCTT CTTTCCGGCA CGATCGATGC GCTGCTCAAT CAGAATGCGG GTCACGAAGT CAGGAGCGCC ATCCGGGTGC TGAAAGCAAA GGCGGACGGG CTGCCCGTGA TCGCGGCGCA GGAACGCATC CGCATCGATA TTTTCCTGAA GGACAACCTG CCGCTCGAGC AGGAATAG
|
Protein sequence | MRPTVHDIAA EAGVSLATVD RVLNNRPGVR SVTRNRVERA IATLGYVRDV AAANLAKSRS YPLVFILPSG ENAFMRGLES ELRFAMSRSA AERLDITILS VPAFDAPALA AALLDARKRR PAGVAVVAVE APEVTEAVKR LCEDGVSVVT LVSDLPGSGR DHFAGVDNVA AGRTAGSLLG RFLGGREGPV AVLAGSMLVR DHRDRLEGFR AVTSEDFASR QVLPVIEGQD NPLLVEKLVG ALLERNPDLA GIYSLGAGNR GLIAALEKAG RAKSVCTIAH ELTPHSRAAL LSGTIDALLN QNAGHEVRSA IRVLKAKADG LPVIAAQERI RIDIFLKDNL PLEQE
|
| |