Gene Smed_5211 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5211 
Symbol 
ID5319513 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp173424 
End bp175541 
Gene Length2118 bp 
Protein Length705 aa 
Translation table11 
GC content63% 
IMG OID640776989 
Producthypothetical protein 
Protein accessionYP_001313921 
Protein GI150377326 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTCC CGATTGTTGC ATCCATAGGT GTTGTGACCG CCGCGCTTGC CACCGCCGGG 
CTCGCCGGAG ACGGGGAGTT GCCCAGGGGA TTGGCGCAGG TGATCGTCGA TTCGGATCGA
GCCGATGACG CATCAACTGC TGTCCAACTC TCACAGGCTC ACGGACCGGC TGCCCCTGAG
AGCACGCAAG ATCCGCCGCC TTTGCGCGTC GATGAATCAG CCTTGCGCTA CTTCGCCAGC
AAGGGCGATA CGGCTCGTCT GGAGGCCGAG ATCGCCCGCC TGCGGGCGCT CTATCCAAAC
TGGACGCCGC CGCAAGATCC GCTGGCGGTG CCGCAGAACA GCGATGCTCA ATTGGAGACG
ATGTGGCGGC TCTATTCCGA GGGGCGCTAT GCGGAAGTGC GAAGGGCCAT AGCCGAACGT
CAGTCGACCG ATCCGGATTG GCAGGTTCCA ACCGATTTGC TTGAGCGGCT GAATGTCGCC
GAGGCACGTG CCAGACTCGT CAATGCGTCC GATCTGAAGC AATACGAAAC CGTAATCCAG
GTAGGAGCAG GCACGCCGAG TCTTCTGACG TGCAGCGACA ATGACGTTCT TTGGCGTGTG
GCCGAAGCCT TTGCTGAGAC CAAGAGGCCT AACCGGGCGC GCGACGCATA CCTCTACATT
CTCAAGAACT GCGACAACGA GCCCGAACGG CTCGCTACCG TGCAGAAGGC TGCCTCTAAT
CTTTCCTATT CGAGCATGCA GGATCTGCTC TCCTATGAGA GAACGAATTC CGCGGGGGCG
CTGGAGTTCG AGAGCATTCG CGACGATCTC GCCCGCCGCT TTGTCGCCGC AGGCGATGAG
GACGCGACGC TTGAGGTGGA TCCGAAGTAT CTCCAACGTG TCGAGCGGCT GGCGGAAACG
GAAGGGTCCG CCGCGGATGC GCTGCTCCTT GGCTGGTACC AACTTCGCCG CAAGAACACG
AGTGAGGCGG AACGCTGGTT CCGTCTGGCG CGCGACAAGC AGGACTCTGC GCCGGCATCA
CAGGGCCTTG CGCTCGTCTT GATCGAGCGC AAGGCACCCG AGGAAGCCGA AAAGGTGCTC
TACCCATGGC GGGACGCCTC CAGTGATGCG CGAGCGACCT ACTTCGCCGC GACCGCCAAC
CTCCTGGCGA TAGATCCGCC CGTGGCTCTT AGCGCAGATG TTCTTCAAAG GATTGCACAG
GAGACGATGA AGAGTCGCGA TGCCGCGACA GCGCAGCAAT TCGGTTGGTA CGCGCGTCTC
CTTGGGCAGC CAGCCACGGC TGTGCAGTGG TTTTCGACCA CGCTTGGCTG GAAGCCGGAC
GACGAGCCCT CCGCCTATGG CCTTTCAATC AGTCACAAGG AACTCGGGGA CAGGGCAGGG
GTCGCGGAGA TCCAGCGTCT CTGGGCGGCG AAATCCGATA GAATTGCGCG CTTAGGTGAT
GAGGTGGACG GCCAGGCGAA GGCCAACACG GCTCCGCCAG GATCGGCCAC GATCGCTGCT
GGGTCGGCGC CAATCAGAAC AGCGGCGGCG CCAGCGAAAG GCGCGCCGCT TGGGAGCGCC
GCCCGTGGGT CACAAACGAC CCCTCGCAAG CTCAGCGGCT GCCGTTCGAC GATCGATCCG
CGCGGCCTTC CGCCGGGGGC GGCTCTTGCC CGTGGTTGGT GCCTCATGGA TCTGAACCGG
CCTCTGGAAG CCGCCGAAGC TTTCGAGGTC GCTCTGCGAG CGTCCGCTTC GAAACTGCGC
GAGGATGCTG CCTACGGACA GAGCCTTGCC TATCTGCGCG CCGGACTGAC CGGGAAGGCG
GCCGTTGCGG CTGCGCGGTC GCCACAAAGT CTTGCCCGCA GAAATGAACT GCAGACCGCT
ATCCTGGCCG ACCGAGCCGT CGCCGCCTTC GATGCGGGAC GGTATAACGA GACGCTTCTG
TTTCTGCAGC AGCGTCGCCA ACTGGCAACC GAGCGGACGG ACCTGATGGT GCTGCGGGGA
TATGCCTATC TGAAACTCAA GCGCTATGCG CAAGCCAAAC GAATTTTCGA GGCGGCCGCG
GCGACTGGCA ATCGAGATGC AATACGCGGG CTCTCGGACG TTCTGGCGGA GCAGCAGGTT
TGGCCGCGCA AATTCTGA
 
Protein sequence
MKFPIVASIG VVTAALATAG LAGDGELPRG LAQVIVDSDR ADDASTAVQL SQAHGPAAPE 
STQDPPPLRV DESALRYFAS KGDTARLEAE IARLRALYPN WTPPQDPLAV PQNSDAQLET
MWRLYSEGRY AEVRRAIAER QSTDPDWQVP TDLLERLNVA EARARLVNAS DLKQYETVIQ
VGAGTPSLLT CSDNDVLWRV AEAFAETKRP NRARDAYLYI LKNCDNEPER LATVQKAASN
LSYSSMQDLL SYERTNSAGA LEFESIRDDL ARRFVAAGDE DATLEVDPKY LQRVERLAET
EGSAADALLL GWYQLRRKNT SEAERWFRLA RDKQDSAPAS QGLALVLIER KAPEEAEKVL
YPWRDASSDA RATYFAATAN LLAIDPPVAL SADVLQRIAQ ETMKSRDAAT AQQFGWYARL
LGQPATAVQW FSTTLGWKPD DEPSAYGLSI SHKELGDRAG VAEIQRLWAA KSDRIARLGD
EVDGQAKANT APPGSATIAA GSAPIRTAAA PAKGAPLGSA ARGSQTTPRK LSGCRSTIDP
RGLPPGAALA RGWCLMDLNR PLEAAEAFEV ALRASASKLR EDAAYGQSLA YLRAGLTGKA
AVAAARSPQS LARRNELQTA ILADRAVAAF DAGRYNETLL FLQQRRQLAT ERTDLMVLRG
YAYLKLKRYA QAKRIFEAAA ATGNRDAIRG LSDVLAEQQV WPRKF