Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4961 |
Symbol | |
ID | 5318476 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 1473587 |
End bp | 1475953 |
Gene Length | 2367 bp |
Protein Length | 788 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640776743 |
Product | exopolysaccharide transport protein family |
Protein accession | YP_001313675 |
Protein GI | 150377079 |
COG category | [D] Cell cycle control, cell division, chromosome partitioning [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0489] ATPases involved in chromosome partitioning [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | [TIGR01005] exopolysaccharide transport protein family [TIGR01007] capsular exopolysaccharide family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.184697 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.0547536 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAGAA CGGCCCCCAT GAAACAAAGA AGCGTTCCCT TGAGCAGCAT CATGCCCAGC GAAGAACAGT CCGACGGCTT TATCGACCTT GACCGGCTTG TAGCCGCCGT CTTCCGTCGC GCCCGGCTCG TGACCGCATT CGTCGTTCTC TTCATCGCAC TGGGGGCCGC TTATCTCCTG TTTGCCACGC CCTATTACAC GTCGATGACG CAGATCCTTC TCGACGAGAA CCTGTCCAAA TACGCCGAAG AGGAGCCTGT CCCGGTAAAC AGCCAGATGC TGGATACGCA GATCGCAAGT GCGGTCGAAA TCCTCAAGTC GGGCGAACTC GCGCTTCGCG TCGTCGACAA GCTCAAGCTT TCGGAAAACG AAACCATCCT CAATCCGCCG CGCTCGCCCG TCGACGTCGT GAAGGAATGG CTGAAGACGG CCACCGGACT CTTTGCGGGC GGACCTGAAG TAACCGAGGC GGCAGCGCGC AACGGCCGGC GGCAGAAAGC TGCCGCGATC ATCCAGCAAT CGCTTGCGGT CGAGCGTGTT TCGCGAAGCT CGGTCGTCGC CGTGGCTTTT CGCTCGAGCG ATCCGGTGCT TGCGGCGACC ATCGCGCGTG GATATGCCAG CGCATATCTG ACAGACCAGC TGAACGCCAA TTTCGAAGCT ACGGAACGCG CATCCGTCTG GCTGCAGGAA CGGCTTACGG ATCTTCAGCA ACGCTCGCAG GCCGCCGCGC TGGAAGTCGC ACACTACAGG GCCGAGAACG GTTTGACGGC TGCGCGGGGC GAGTTGATGT CCGAGCAGCA AATGGCCGAC CTCAACAGCC AGCTCATCGT TGCCCAGGCC GACACGGCAA GTGCCTCCGC CCGCTACAAT CAGTACAAGT CGATCGTCGA CCAGGGGCCG GAAAACGCAG TGAAGAACGC CACCATCTCC TCCAAGGAGG GTGACAATTC GGTAATCCGC GACCTGAGGA CGCGCTATCT CACCGTCGGT AAGCGCGAGC GCGAAGTCTC CGACAATTTC GGCGCCGACC ACCCGCAAGC GGTCTCGCTC CGCACCGAAC AGGAGGATGT GGCCCGTCAG ATCTACCAGG AACTGCAGCA GCTCACTGCA AGCTATAAGA ACGAATACGA GGTCGCTCAG TCGCGCGAGG CATCGCTCAG GAAAAGCATC CAGGGGATTG CCGGCAAAAC CTCCGACTCG AGCGAGCAAC TCGTGCAATT GAGAGAGCTG GAACAGAAAG CCGCGGCCCT GAAGACGCTC TACGAGTCTT ATCTCGGACG CTACGAACAG GCGACTCAGC AGCAAACCTT CCCGATCGCC AAGGCTCGTG TCATTTCCGA AGCGGGCGTG CCCGTGTCAC CGTCGAGCCC AAAAAAAACC ATGACGCTGG CGCTTTCGGC GGTGCTCGGC CTGATGGTCG GCGGTGCATA CGCGGCCTTC CTCGAATTCC GCGAACGGAC CTTCCGCCTG GAGGGCGACG TACGCTCGAT CCTCGGCCAT CGTTCGTTCG GCTACGTTCC GCTTCTCGGC ACGCGCATCA AGAAGAAGGC GCAACTCGTT CACGCACATT TCGGTTCGGT GAAGAGAGCC GACGAAGCGG TGGACAATAC GATGCCGTTC CAGCGATTGT CGCGCATCGT GGTCGATGCG CCGCGGTCGA CCTTCGCGGA GACCTTTCGC AACGCCAAGC TTGCCTGCGA CCAGATGCTG GCGGGCAGTG AAAGCCGCGT GATCGCCATC GCATCGGCCC TTCCGGATGA GGGGAAATCG ATCATTGCCG CAAACTTCGC CGCGCTTCTG GCCGCGAGCG GCAAGCGGAC CCTGCTCATC GATGCCGACA TACGCAAGCC TGGCCTGACG CAGATGATTA CGCCTGCCCC GCGCACCGGG CTGGTGGAAA CGCTGACCGG AGAAGCCACC TGGCCCGCCG GGATCAAGGT GGACCAGCGT ACGAAACTGG CAATCCTTCC GGCCGGCGGT GCATCGCACC AGCGCCACCA GAGTAACGAG CTGCTTGCCT CGCCGGGCAT GGCGAACCTG ATCGAGAACG CGCGCAACGC CTTCGACTAT GTCGTCGTCG ACCTTGCGGC GCTCGCCCCC GTCGTCGACG CGAAAGCCTT TGCGCCGCTG GCCGACGGCA TCCTCTTCGT GGTCGAGTGG GGAAGAACGC CTTCGCGCCT CGTGCGCGAT CTGCTCAACT CGGAACCGCT GATCAATTCG AAAGTGCTAG GTGTGATCCT CAACAAGACG GACATGAACG AGCTGGGCAA ATACAGCGAT TTCGACGGGG CAGAAAAGTA CCGCCACCGC TACGGCAAAT ACTACATAGA AAACAGTTTC ACGGACAATC AGAACACCGC CGCTTGA
|
Protein sequence | MNRTAPMKQR SVPLSSIMPS EEQSDGFIDL DRLVAAVFRR ARLVTAFVVL FIALGAAYLL FATPYYTSMT QILLDENLSK YAEEEPVPVN SQMLDTQIAS AVEILKSGEL ALRVVDKLKL SENETILNPP RSPVDVVKEW LKTATGLFAG GPEVTEAAAR NGRRQKAAAI IQQSLAVERV SRSSVVAVAF RSSDPVLAAT IARGYASAYL TDQLNANFEA TERASVWLQE RLTDLQQRSQ AAALEVAHYR AENGLTAARG ELMSEQQMAD LNSQLIVAQA DTASASARYN QYKSIVDQGP ENAVKNATIS SKEGDNSVIR DLRTRYLTVG KREREVSDNF GADHPQAVSL RTEQEDVARQ IYQELQQLTA SYKNEYEVAQ SREASLRKSI QGIAGKTSDS SEQLVQLREL EQKAAALKTL YESYLGRYEQ ATQQQTFPIA KARVISEAGV PVSPSSPKKT MTLALSAVLG LMVGGAYAAF LEFRERTFRL EGDVRSILGH RSFGYVPLLG TRIKKKAQLV HAHFGSVKRA DEAVDNTMPF QRLSRIVVDA PRSTFAETFR NAKLACDQML AGSESRVIAI ASALPDEGKS IIAANFAALL AASGKRTLLI DADIRKPGLT QMITPAPRTG LVETLTGEAT WPAGIKVDQR TKLAILPAGG ASHQRHQSNE LLASPGMANL IENARNAFDY VVVDLAALAP VVDAKAFAPL ADGILFVVEW GRTPSRLVRD LLNSEPLINS KVLGVILNKT DMNELGKYSD FDGAEKYRHR YGKYYIENSF TDNQNTAA
|
| |