Gene Smed_3176 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3176 
Symbol 
ID5324055 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3339534 
End bp3340703 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content63% 
IMG OID640792124 
Producthypothetical protein 
Protein accessionYP_001328835 
Protein GI150398368 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGCA TTTTTCTTCC GTGCCAGGAC GGCTCGCTCG AAGAATACCG GCTCCAGGGA 
ATGCCTATCG CCCGTCCCGG CGCCGTTCCG GCGTTCAGCC GTATCGCCTA TGCGGCTGCG
CATGTCGTTT CCGATCCGCT CCGCGACGCA GACCCCTGGG GCAATCCTGC GATCGACTGG
GAGGCGACGA TGGCCTTCCG GCATCATCTG TGGGGCCTCG GCTTCAGGAT TGCCGAAGCA
ATGGACACCG CGCAACGCGG GATGGGCCTT ACATGGCCGG CGGCCCGGCA ACTGATACGC
CGCTCGCTCG CCGAAGCACG CAGCGTTCCG GGCGCCGATC TTGCCTGCGG CGCCGGCACC
GACCATCTTG CGCCCGCGGA CGCGCGATCC ATCGAAGACG TCATTGCCGC CTATGAGCAG
CAAATCGGCT TCGTCGAAGC CGAGGGCGGC CGTGCGATCA TGATGGCGAG CCGGGCTCTG
GCCCGCGTGG CGCGCTCCCC CGCCGACTAC CGGCGTGTCT ACGGCCACAT CCTGTCCCAG
ACGAAAGAAA AGGTGATCCT GCACTGGCTG GGCGACATGT TCGACCCGCA GCTTCGAGGA
TATTGGGGCT CGGAAAACTT CGAGGAAGCG CTCGAAACCG TTCTGGCGAT CATCGGCGAG
AACAGCGCCA GGGTTGAGGG CATCAAGATT TCACTGCTCG ACAATGCCAA GGAACTGGCC
CTGCGCAACC GGCTGCCCGA AGGCGTGCTT TGCTTCACCG GCGACGACTT CAACTATGCG
GAACTGATCG AGGGAGACGG CACGAAATAC AGTCACGCGC TGCTCGGCAT ATTCGATGCG
GTCGCACCTT CGGCGTCGAA GGCGCTTGCG GCGCTCGCGA GCGGAGATCT CTCAACCTTC
CGCGGCGTCA TCGAACCGAC AGTACCCCTG TCGCGCAAGA TCTTCGAGGC GCCGACGCAA
TATTACAAGG CCGGCGTCGT CTTCCTCGCC TGGCTGAACG GTCATCAACG GCATTTCACC
CTGCCCGCCG GCCTTCAGTC GGCTCGCGGA TTGCTCCATT ATGCCGATAT TTTCCGCCTG
GCAGATCAGG CCAATGTGCT CGACAAGCCG GAGCTGGCTG TTGCGCGGAT GCGCAATCTG
CTTGGGGTGC TGGGAGTGGA GCAGTCGTGA
 
Protein sequence
MTSIFLPCQD GSLEEYRLQG MPIARPGAVP AFSRIAYAAA HVVSDPLRDA DPWGNPAIDW 
EATMAFRHHL WGLGFRIAEA MDTAQRGMGL TWPAARQLIR RSLAEARSVP GADLACGAGT
DHLAPADARS IEDVIAAYEQ QIGFVEAEGG RAIMMASRAL ARVARSPADY RRVYGHILSQ
TKEKVILHWL GDMFDPQLRG YWGSENFEEA LETVLAIIGE NSARVEGIKI SLLDNAKELA
LRNRLPEGVL CFTGDDFNYA ELIEGDGTKY SHALLGIFDA VAPSASKALA ALASGDLSTF
RGVIEPTVPL SRKIFEAPTQ YYKAGVVFLA WLNGHQRHFT LPAGLQSARG LLHYADIFRL
ADQANVLDKP ELAVARMRNL LGVLGVEQS