Gene Smed_3803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3803 
Symbol 
ID5318101 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp253480 
End bp254823 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content64% 
IMG OID640775616 
Producthypothetical protein 
Protein accessionYP_001312549 
Protein GI150375953 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.853464 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATCG CGGCCCCCCG TTTGTACCGT CTTCTTGCAG TCCTGTTCTG CTGGCTTTCG 
ACCCAAGGCC CGGCACTCTC CGCCGATCCG CTCGCCCGGG CAGCGCTGCA GTCGCAGGGA
ACGCTCTATG CCGGCCAGCA AATCCTTGTA GACGTCGACG TCCTCGTCCC GAACTACTTC
CTGCAATCGC CTCAATTCCC CGCAATCGAT ATTCCAGGCG CGATAGTGAC GCTTGACGAC
GGAAGGGCAC TGAACCTCAA CGAAACGATC GACGGAACGC CCTATTCCGG CATTCGGCGC
ACCTACATAA TAGTACCGCA GGCGGCCGGC GATTTTACCT TGCCCCCCGT CCCGATCACC
TTCGGCTATG CAGCGGTCCC CCCTCAGGCC ACCCGGGGCG AGGTCACGTT TCCGCCGCTT
CGCTTCACAG TTCGAAGCGC GCCGGGCACC GCCGGCGATA GTCCCGGTAT CGTTGCGGCA
AAAGTCAGCG TCAGCCAGGA ACTGGATCAG GATCCGGCGC AGTTGAAGGC TGGCGATACG
ATGGTTCGCA CGGTCACGGT GCGGGCCGAA GGGCTTCGCG CCATGATGAT CCCCGAACCG
GACTTTACGG CTCCTCAGGG TGTCCGCCTC TACCGGCAGG ATCCTTCACT TTCGGAAGAG
ACGGACCGCA ACGGCCAGTC GATCGCGGGC CTCCGCAAGG ACGTTGCAAG CTATCTCTTC
CAAGACGCCG GCAATTATGT TCTGCCTGCC GTGACCGTTA GCTGGTTCGA CCCGGCTTCG
GCGAAGACCC AGTCGGCCAC GGCTCCCGCG GTCAGCGTGA CGGTTTCAGC GGCGGCCGCC
CTCTCCCCTG CTCTTGCGCC CCCCGCGCCG GAGCCGCAAC GCGATGCCTT CGACTGGCTG
CACCTGGCTC TGATCGGGGG GATCGTGCTT CTCACCGCCT CCTCGCTTTG GGTTGCCGCC
AACGGATTGT CCCGGCTGGA AGCCTGGTGG CAAGAGCGCC GCTCGAGAGA GCGGCAGTCG
GAGCCGGCTT TTTTCCGGCA AGTGGAGCAG GCGTGCAAAA GCGGCCGAGA CGACGCGATC
GCGCGCGCGC TCGACGCTTG GTCGCGCAAA GCGGGAGCGA TGCCGCTCGA ACTTTGGCTT
GGGCGCTTCG CGGATGCCGA GACCAAGCAG GTTTACCAGG CCTGGCAGAG AACGCGTTAC
GCTTCTCAGC AAGCATCCCA GCCGTTAGGA GCAGGCTTGC TTCTGCCCGG CCTGAAGAAG
GCTCGGGAAG CCTGGTTGAT GCAGGAAACG GAAGCGCCCG GCCGCAAGCC GGCCTTGCTT
CCCCTCAACC CGACTATGCC GTGA
 
Protein sequence
MTIAAPRLYR LLAVLFCWLS TQGPALSADP LARAALQSQG TLYAGQQILV DVDVLVPNYF 
LQSPQFPAID IPGAIVTLDD GRALNLNETI DGTPYSGIRR TYIIVPQAAG DFTLPPVPIT
FGYAAVPPQA TRGEVTFPPL RFTVRSAPGT AGDSPGIVAA KVSVSQELDQ DPAQLKAGDT
MVRTVTVRAE GLRAMMIPEP DFTAPQGVRL YRQDPSLSEE TDRNGQSIAG LRKDVASYLF
QDAGNYVLPA VTVSWFDPAS AKTQSATAPA VSVTVSAAAA LSPALAPPAP EPQRDAFDWL
HLALIGGIVL LTASSLWVAA NGLSRLEAWW QERRSRERQS EPAFFRQVEQ ACKSGRDDAI
ARALDAWSRK AGAMPLELWL GRFADAETKQ VYQAWQRTRY ASQQASQPLG AGLLLPGLKK
AREAWLMQET EAPGRKPALL PLNPTMP