Gene Smed_3884 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3884 
Symbol 
ID5318564 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp340409 
End bp342157 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content65% 
IMG OID640775696 
Producthypothetical protein 
Protein accessionYP_001312629 
Protein GI150376033 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.329312 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.887 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGTAT ACGGTGATGC AAAGCGCCTC GAGCGCGCCG ACGAATTGTC AGCCTCGATC 
GCGTCGCGGC TGCGCAGGAT CGAAGATCAG CCCCCAGGTA TCGAGCGGCA CACCGAACTC
GTTGCGATCC TGCTCAAAGC CGGAGAACTG GTGCAGGGGC TTTCGGATGC GGAGTTCAAA
CAGGCGAGAG CCGACGAGAT CTCCGCCTCC CGTGAGGCTG GCGAGCGTCT CCTCCTCGAC
CTTGCGATGC TCGTCGCCCG TTCCTGGCAA CGGGGGTTCA CGGAGCGGGT TAGCGTACCG
GCCCATGTCG CGAACCTGCT CGCGACCATC AGCGCGCCGA TCTCCCTCGG TATCCGGGAA
CCGGAAGGCT ATGCCTTTTA CGCTCTCTAC CCCGAGAGCT ATCTGGAGGC CGCGCGAGAG
TCCGGTCTGG GCTCCGATGC CGTGGTAATC GGTCTGCGGA GCATCGGAAC CACTCTTTCC
GCCATTGTCG CCGCCGCCCT GCATGCAGCG CCGCCGCTGA CGCTGCGTCC CAAGGGAGAT
CCGTTTCGAA GGCAGCTTGC AATCGCCCCG CAGCTTGCCG GACGGCTGCT GCGCAACCCG
GCAGCGGGTT TCGCTATTGT CGACGAAGGG CCCGGCCTTT CCGGCAGCTC GTTCGGATGC
GTAGCCGATT GGCTCGAGGA TCATGGCGTC GCCGCCACGC GCATTCACTT CTTCCCAAGC
CACAAGGGCG ACCTTGGACC GCAATCCTGC GGCCGCCATC GCAGACGCTG GGCGACGAGC
CCCCGCCATG TCGTGGACGT CGACGATCTC CTGATCAAGC CGGCCGGCTC TCCGCGCCAC
CTCGCCGAGT GGGTCGGCCG CCTCGTTGGA CCGCTTGAGC GGCCGCTCGA GGATATCTCC
GCGGGCGGAT GGCGAAAGGC GCTCCCCGGT GATTGCAGGC CTCCAGTCGA TATCAGGTTC
GAACGAAAGA AGTTCCTCGC GCGCACCGCC GACGGAGCGT GGCTCGTCAA GTTCGCCGGG
CTCGCCGACG TCGGACAGCG CAAACTCGTC AGGGCGCGCC TTCTCGCCGA CGCAGGCTTC
GCACCCCCGG TTGCCGGACT GTGCCACGGC TTCCTGGTTC AGAAATGGGT CGCAGCGAGA
CCTATGGCCC CTTCGGAATT GCGCCACCCG GCCTTCATAG CGCATCTCGG TCGCTATCTG
GCCTTCCGGG CACGAAGCCT GCCGCCGCCA AAGACGCAAG GGGCGTCCAT CGCCCAGCTT
TGCGAAATAG CTTCGGTCAA CACCGAAGAG GGACTGGGGT CAGCGGCCGC GTCACGCCTC
AAAAGCCGGC TCCGAAATGC GGAGCGCTTT CATGCGGCGA TCCTGCCGGT CGATACCGAC
AATCGGCTCC ATAGCTGGGA ATGGCTGGGC GAAGGAGCAC GGCAGTTCCT CAAGGCGGAC
GCACTCGATC ACAGCGGCGG CCATGATCTC GTCGGCAGTC AGGACATCGG CTGGGACATC
GCGGGGGCGC GCATCGAACT CGGCCTCACG CGAGACGAGC AGGCCGAACT GAGAGCGGCC
GTATCCGAAA ACGGCTGGCG CCCTCCCGAC GCTGAGCTGC AGGAGATTTT CGACCTCTGC
TATGCCGCTT TCCAGTTCGG CCTGTGGGCT TCCGCAAAGT CCGCTGCAGC CCCGGAGGAA
GTTCACCGGC TGGAGACAGC CGCTGCGCGT TACGGCAGCC TCCTTAGGAA CGCGACCGAG
GGCTTTTAG
 
Protein sequence
MLVYGDAKRL ERADELSASI ASRLRRIEDQ PPGIERHTEL VAILLKAGEL VQGLSDAEFK 
QARADEISAS REAGERLLLD LAMLVARSWQ RGFTERVSVP AHVANLLATI SAPISLGIRE
PEGYAFYALY PESYLEAARE SGLGSDAVVI GLRSIGTTLS AIVAAALHAA PPLTLRPKGD
PFRRQLAIAP QLAGRLLRNP AAGFAIVDEG PGLSGSSFGC VADWLEDHGV AATRIHFFPS
HKGDLGPQSC GRHRRRWATS PRHVVDVDDL LIKPAGSPRH LAEWVGRLVG PLERPLEDIS
AGGWRKALPG DCRPPVDIRF ERKKFLARTA DGAWLVKFAG LADVGQRKLV RARLLADAGF
APPVAGLCHG FLVQKWVAAR PMAPSELRHP AFIAHLGRYL AFRARSLPPP KTQGASIAQL
CEIASVNTEE GLGSAAASRL KSRLRNAERF HAAILPVDTD NRLHSWEWLG EGARQFLKAD
ALDHSGGHDL VGSQDIGWDI AGARIELGLT RDEQAELRAA VSENGWRPPD AELQEIFDLC
YAAFQFGLWA SAKSAAAPEE VHRLETAAAR YGSLLRNATE GF