Gene Smed_4961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4961 
Symbol 
ID5318476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1473587 
End bp1475953 
Gene Length2367 bp 
Protein Length788 aa 
Translation table11 
GC content61% 
IMG OID640776743 
Productexopolysaccharide transport protein family 
Protein accessionYP_001313675 
Protein GI150377079 
COG category[D] Cell cycle control, cell division, chromosome partitioning
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0489] ATPases involved in chromosome partitioning
[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR01005] exopolysaccharide transport protein family
[TIGR01007] capsular exopolysaccharide family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.184697 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0547536 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAGAA CGGCCCCCAT GAAACAAAGA AGCGTTCCCT TGAGCAGCAT CATGCCCAGC 
GAAGAACAGT CCGACGGCTT TATCGACCTT GACCGGCTTG TAGCCGCCGT CTTCCGTCGC
GCCCGGCTCG TGACCGCATT CGTCGTTCTC TTCATCGCAC TGGGGGCCGC TTATCTCCTG
TTTGCCACGC CCTATTACAC GTCGATGACG CAGATCCTTC TCGACGAGAA CCTGTCCAAA
TACGCCGAAG AGGAGCCTGT CCCGGTAAAC AGCCAGATGC TGGATACGCA GATCGCAAGT
GCGGTCGAAA TCCTCAAGTC GGGCGAACTC GCGCTTCGCG TCGTCGACAA GCTCAAGCTT
TCGGAAAACG AAACCATCCT CAATCCGCCG CGCTCGCCCG TCGACGTCGT GAAGGAATGG
CTGAAGACGG CCACCGGACT CTTTGCGGGC GGACCTGAAG TAACCGAGGC GGCAGCGCGC
AACGGCCGGC GGCAGAAAGC TGCCGCGATC ATCCAGCAAT CGCTTGCGGT CGAGCGTGTT
TCGCGAAGCT CGGTCGTCGC CGTGGCTTTT CGCTCGAGCG ATCCGGTGCT TGCGGCGACC
ATCGCGCGTG GATATGCCAG CGCATATCTG ACAGACCAGC TGAACGCCAA TTTCGAAGCT
ACGGAACGCG CATCCGTCTG GCTGCAGGAA CGGCTTACGG ATCTTCAGCA ACGCTCGCAG
GCCGCCGCGC TGGAAGTCGC ACACTACAGG GCCGAGAACG GTTTGACGGC TGCGCGGGGC
GAGTTGATGT CCGAGCAGCA AATGGCCGAC CTCAACAGCC AGCTCATCGT TGCCCAGGCC
GACACGGCAA GTGCCTCCGC CCGCTACAAT CAGTACAAGT CGATCGTCGA CCAGGGGCCG
GAAAACGCAG TGAAGAACGC CACCATCTCC TCCAAGGAGG GTGACAATTC GGTAATCCGC
GACCTGAGGA CGCGCTATCT CACCGTCGGT AAGCGCGAGC GCGAAGTCTC CGACAATTTC
GGCGCCGACC ACCCGCAAGC GGTCTCGCTC CGCACCGAAC AGGAGGATGT GGCCCGTCAG
ATCTACCAGG AACTGCAGCA GCTCACTGCA AGCTATAAGA ACGAATACGA GGTCGCTCAG
TCGCGCGAGG CATCGCTCAG GAAAAGCATC CAGGGGATTG CCGGCAAAAC CTCCGACTCG
AGCGAGCAAC TCGTGCAATT GAGAGAGCTG GAACAGAAAG CCGCGGCCCT GAAGACGCTC
TACGAGTCTT ATCTCGGACG CTACGAACAG GCGACTCAGC AGCAAACCTT CCCGATCGCC
AAGGCTCGTG TCATTTCCGA AGCGGGCGTG CCCGTGTCAC CGTCGAGCCC AAAAAAAACC
ATGACGCTGG CGCTTTCGGC GGTGCTCGGC CTGATGGTCG GCGGTGCATA CGCGGCCTTC
CTCGAATTCC GCGAACGGAC CTTCCGCCTG GAGGGCGACG TACGCTCGAT CCTCGGCCAT
CGTTCGTTCG GCTACGTTCC GCTTCTCGGC ACGCGCATCA AGAAGAAGGC GCAACTCGTT
CACGCACATT TCGGTTCGGT GAAGAGAGCC GACGAAGCGG TGGACAATAC GATGCCGTTC
CAGCGATTGT CGCGCATCGT GGTCGATGCG CCGCGGTCGA CCTTCGCGGA GACCTTTCGC
AACGCCAAGC TTGCCTGCGA CCAGATGCTG GCGGGCAGTG AAAGCCGCGT GATCGCCATC
GCATCGGCCC TTCCGGATGA GGGGAAATCG ATCATTGCCG CAAACTTCGC CGCGCTTCTG
GCCGCGAGCG GCAAGCGGAC CCTGCTCATC GATGCCGACA TACGCAAGCC TGGCCTGACG
CAGATGATTA CGCCTGCCCC GCGCACCGGG CTGGTGGAAA CGCTGACCGG AGAAGCCACC
TGGCCCGCCG GGATCAAGGT GGACCAGCGT ACGAAACTGG CAATCCTTCC GGCCGGCGGT
GCATCGCACC AGCGCCACCA GAGTAACGAG CTGCTTGCCT CGCCGGGCAT GGCGAACCTG
ATCGAGAACG CGCGCAACGC CTTCGACTAT GTCGTCGTCG ACCTTGCGGC GCTCGCCCCC
GTCGTCGACG CGAAAGCCTT TGCGCCGCTG GCCGACGGCA TCCTCTTCGT GGTCGAGTGG
GGAAGAACGC CTTCGCGCCT CGTGCGCGAT CTGCTCAACT CGGAACCGCT GATCAATTCG
AAAGTGCTAG GTGTGATCCT CAACAAGACG GACATGAACG AGCTGGGCAA ATACAGCGAT
TTCGACGGGG CAGAAAAGTA CCGCCACCGC TACGGCAAAT ACTACATAGA AAACAGTTTC
ACGGACAATC AGAACACCGC CGCTTGA
 
Protein sequence
MNRTAPMKQR SVPLSSIMPS EEQSDGFIDL DRLVAAVFRR ARLVTAFVVL FIALGAAYLL 
FATPYYTSMT QILLDENLSK YAEEEPVPVN SQMLDTQIAS AVEILKSGEL ALRVVDKLKL
SENETILNPP RSPVDVVKEW LKTATGLFAG GPEVTEAAAR NGRRQKAAAI IQQSLAVERV
SRSSVVAVAF RSSDPVLAAT IARGYASAYL TDQLNANFEA TERASVWLQE RLTDLQQRSQ
AAALEVAHYR AENGLTAARG ELMSEQQMAD LNSQLIVAQA DTASASARYN QYKSIVDQGP
ENAVKNATIS SKEGDNSVIR DLRTRYLTVG KREREVSDNF GADHPQAVSL RTEQEDVARQ
IYQELQQLTA SYKNEYEVAQ SREASLRKSI QGIAGKTSDS SEQLVQLREL EQKAAALKTL
YESYLGRYEQ ATQQQTFPIA KARVISEAGV PVSPSSPKKT MTLALSAVLG LMVGGAYAAF
LEFRERTFRL EGDVRSILGH RSFGYVPLLG TRIKKKAQLV HAHFGSVKRA DEAVDNTMPF
QRLSRIVVDA PRSTFAETFR NAKLACDQML AGSESRVIAI ASALPDEGKS IIAANFAALL
AASGKRTLLI DADIRKPGLT QMITPAPRTG LVETLTGEAT WPAGIKVDQR TKLAILPAGG
ASHQRHQSNE LLASPGMANL IENARNAFDY VVVDLAALAP VVDAKAFAPL ADGILFVVEW
GRTPSRLVRD LLNSEPLINS KVLGVILNKT DMNELGKYSD FDGAEKYRHR YGKYYIENSF
TDNQNTAA