Gene Smed_3890 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3890 
Symbol 
ID5318684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp347980 
End bp349110 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content62% 
IMG OID640775702 
Producthypothetical protein 
Protein accessionYP_001312635 
Protein GI150376039 
COG category[S] Function unknown 
COG ID[COG4641] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0848246 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGATTTC TTTTCTACAC CCACTCATTG ATTTCCGATT GGAACCACGG CAATGCGCAT 
TTTCTGCGCG GTGTCATGCG TGAAATCACA CGCCGCGGCC ATCTGGCCGT GGCGCTTGAG
CCGGGCGATT CCTGGAGCCG CCGCAATCTG ATTGCCGATC AGGGCATCGG ACCGATCGCC
GCCTTTCGCA AACACTTTCC CGATTTGCAG GTCGCGATCT ACGAGTCCGA TTTCGACCAT
GAAGCGGCAG TCGCCGAGGC TGACATCGTC ATTGTCCATG AGTGGACCGA TCCCGCTCTG
ATCGCCGAAC TCGGCCGCAT CCGCCTGAAG GGTGGACGCT TCACGCTGGC GTTCCACGAC
ACACATCACC GCGCCGTCAG CGCCAAACGG GACATCGCCA GGCTGGACCT TTCCGGTTAC
GACTTCGTTC TGGCTTTCGG TGAGGCGCTG CGCGAGCGCT ATCTGCAGGC GGGATGGGGA
AGGCACGTCC ATACCTGGCA TGAGGCTGCC GACACTTCGC TGTTCCATCC GATGCCGGAG
GTGGAAAAGC GTGGCGAGCT CATCTGGATC GGCAATTGGG GCGATGACGA ACGCAGCAGC
GAAATCATGT CCTTCCTCGT CGAACCGGCA AAAAAGCTGA AACTGAGGGC GACGGTCCGA
GGCGTAAGAT ATCCCGACAC GGCGCTCAGG GCCTTGCGCG CCGCCGAAAT CGACTATGGC
GGCTGGCTCG CGAACGCAGC CGTCCCGCGG GCCTTCGCGG AGCACCGGGT CACCATGCAC
ATTCCGCGCC GACCCTACGT GGAGGCACTC CCGGGCATAC CCACGATCCG CGTTTTCGAG
GCTCTTTCCT GCGGGATTCC GCTGGTCTCG GCACCATGGA CGGATGCCGA AGGGCTATTC
CGGCCCGGCA AGGATTTCTG CATCGCCAGG GACGGCAAAG AGATGGCGCG ACTTCTGCGT
CAACTTCTCG CGGAACCAGC CTTCGCAACG GAGATGGCCG CTTCCGGACT GGAGACCGTC
CGAGCACGCC ACACATGCGG CCATCGCGTC GACGAGCTTC TCTCCATTCT GGCCGCCTAC
ATGCCGCACA GCAACGTGGA ACGCACAGTC ACCGAGGAGG TCCAGCTATG A
 
Protein sequence
MRFLFYTHSL ISDWNHGNAH FLRGVMREIT RRGHLAVALE PGDSWSRRNL IADQGIGPIA 
AFRKHFPDLQ VAIYESDFDH EAAVAEADIV IVHEWTDPAL IAELGRIRLK GGRFTLAFHD
THHRAVSAKR DIARLDLSGY DFVLAFGEAL RERYLQAGWG RHVHTWHEAA DTSLFHPMPE
VEKRGELIWI GNWGDDERSS EIMSFLVEPA KKLKLRATVR GVRYPDTALR ALRAAEIDYG
GWLANAAVPR AFAEHRVTMH IPRRPYVEAL PGIPTIRVFE ALSCGIPLVS APWTDAEGLF
RPGKDFCIAR DGKEMARLLR QLLAEPAFAT EMAASGLETV RARHTCGHRV DELLSILAAY
MPHSNVERTV TEEVQL