Gene Smed_2969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2969 
Symbol 
ID5323846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3115874 
End bp3117529 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content65% 
IMG OID640791920 
ProductHemY domain-containing protein 
Protein accessionYP_001328633 
Protein GI150398166 
COG category[S] Function unknown 
COG ID[COG3898] Uncharacterized membrane-bound protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCGAA TACTGTTCTT CGTCCTGCTC GTCCTTTGCC TTGCTCTCGG CTTTGCCTGG 
CTCGCGGATC GCCCGGGGGA ACTGTCGCTA ATCTGGCAGG GTCAGCTGAT CGAGATGAGC
CTGCTGCGCG CCGCCTCGCT CCTGATTTCG GTTTTCGCCG CCGTTCTGAT CGCCGTCTGG
CTGCTCCGCA CGATCTGGTC GTCTCCCCAT ACGGTCACGC GCTATTTTCG CGCCCGCAAG
CGCGACCGTG GCTATCAGGC GCTGTCGACC GGCCTGATCG CAGCCGGCGC CGGTGATGCC
AATCTCGCAC GCAAGATGAC CGCGCGCACG CGTAGCCTGA TCAGCTCCGA CCAGGAGCCG
CTGATCCATC TCCTGGAAGC GCAGACCTCG CTGATCGAAG GCAAATACGA CGACGCGCGC
AAGAAATTCG AGTTGATGGC GGATGATCCT GAGACGCGCG AGCTCGGCCT GCGCGGTCTT
TACCTCGAAG CCAAGCGGCT CGGCGCAAAT GAAGCCGCGC GCCACTATGC CGAGCGTGCC
GCCGAAAAGG CGCCGCACTT GCCGTGGGCG ACGCTCGCAA CGCTCGACCA TCGCAGCCAG
GCGCGTCAGT GGGACGAGGC TATCCGGCTG CTCGATCAGA GCCGCGCCGC CAATGTGCTG
GAAAGGAAGG AGGCGGATCG CAAGAAGGCC GTTCTCCTGA CAGCGCGGGC GATGGAACAA
CTGGAGGCCG ACCCGAAATC CGCACGTGAC GACGCAAGGG CGGCGCTGAA GCTCGACGAC
AGTCTCGTGC CGGCGGCGCT GGTTGCCGCC AAGGCGCTGT TTCGCGAGGA CAATCTGCGC
AAAGGCGCCT CGATCCTCGA AAAAATGTGG AAGTTGGATC CCCATCCGGA GATCGCGCGT
CTTTACGTGC GGGCGCGCGG CGGCGATTCC GCCCTCGATC GCCTGAAGCG CGCAAAAAGA
CTTGAAACGC TTCGCGGTAA CAATGCTGTT TCATTGGCCA CTGTCGCCGA GGCGGCGCTC
GAAGCGCGCG AACTCGCGCT TGCGCGGACG AAGGCAGAGG CCGCCGCCCG GATCGACCCA
AGGGAAAGCA TCTTTCTGCT GCTCGCGGAT ATAGAAGAAG CCGACACCGG GGACGAGGGC
CGCATCCGTT ACTGGATGTC GCAGGCGCTC AGGAGCCCGC GCGATCCAGC CTGGACCGCT
GACGGTGTCA CATCGCCGAG CTGGCTGCCG GTCTCGCCGG TCACCGGTCG CCTCGACGCG
TTCGAGTGGA AAGCTCCACC GGCGCGGCTA CCGGCCGCAA CCGAAGAAGG GTATCTGAAC
CCGGACGCGG CAATTCGCAG CCTCCCGCCC GTCGCAACCG TGCCTCGCCC GGCCACTGAA
CCCGAGACTG AAAGCGCCGT CGATGCCGCC CCGCCGGCAG AGCGCGTTGA GGCCGAGCCC
GAGAAAGCCA TCACCGTTCC CGACCCGAAC GAAACGGTCG CCCCTCCAGC CGGGGATCCG
AACCGGAAAG AACCGGTCCC GGCGGTCGCC TCGCCCGCCA AAGGCAAGGA TTCCGAGGAG
GCGCCGGATC CCTTTTTCGG CCGACCTCCG GACGATCCGG GCGTACGCGA GCCGGCGTCG
ACGGAACAGA ACACCGGCAA CTTCCGTCTT TTCTGA
 
Protein sequence
MIRILFFVLL VLCLALGFAW LADRPGELSL IWQGQLIEMS LLRAASLLIS VFAAVLIAVW 
LLRTIWSSPH TVTRYFRARK RDRGYQALST GLIAAGAGDA NLARKMTART RSLISSDQEP
LIHLLEAQTS LIEGKYDDAR KKFELMADDP ETRELGLRGL YLEAKRLGAN EAARHYAERA
AEKAPHLPWA TLATLDHRSQ ARQWDEAIRL LDQSRAANVL ERKEADRKKA VLLTARAMEQ
LEADPKSARD DARAALKLDD SLVPAALVAA KALFREDNLR KGASILEKMW KLDPHPEIAR
LYVRARGGDS ALDRLKRAKR LETLRGNNAV SLATVAEAAL EARELALART KAEAAARIDP
RESIFLLLAD IEEADTGDEG RIRYWMSQAL RSPRDPAWTA DGVTSPSWLP VSPVTGRLDA
FEWKAPPARL PAATEEGYLN PDAAIRSLPP VATVPRPATE PETESAVDAA PPAERVEAEP
EKAITVPDPN ETVAPPAGDP NRKEPVPAVA SPAKGKDSEE APDPFFGRPP DDPGVREPAS
TEQNTGNFRL F