Gene Smed_2338 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2338 
Symbol 
ID5323199 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2415116 
End bp2416183 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content62% 
IMG OID640791276 
Producthemin-degrading family protein 
Protein accessionYP_001328005 
Protein GI150397538 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3720] Putative heme degradation protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATGA CTGAGAGCAT GCGCCCCACC CCTGCTGAAA TCCGTGCCTA TCGCGCTGAA 
AACCCCAAGC TTCGCGAGCG CGACATTGCT GCGCAGCTCG GCATTTCGGA GGCAGCGCTG
GTCGCTGCCG AATGCGGTCT CACCGCCATC CGAATCGAAT CCAGCGCCAA CCGCTTCCTT
GAACGCGCCG AAGAACTGGG CGAGGTCCTT GCGCTGACCC GCAATGAAAG CGCCGTCCAC
GAAAAGGTCG GACTCTACGA GAACGTAAAG CAGGGGCACG CCGCGACGCT GGTACTTGGG
TCCGAAATCG ACCTGCGCGT CTTCCCTGGC GCCTGGGAAC ACGGCTTCGC CGTTACAAAA
ACCGATGCCA AGGGGGAGGT TCGCCGCAGC CTGCAGTTTT TCGACAAATG GGGCAACGCG
GTGCACAAGG TCCACTTGCG CCCTGCATCG CATCTCGCGG CCTACGAGAA GCTTGTTGAA
GACCTTCGCC TTGACGACCA ATCGCAAGAC TTCATTGCCG ATCCAGGCGC GCCTGCAAAC
GACGACGTGA CCGATGACTC GGTCGATACG GCAGAGCTGC GCGATCGCTG GTCGAAGCTC
ACCGACACGC ATCAGTTCCC GGGCATGTTG AGAAAGCTCA AGGTCGGTCG GCGCCGGGCG
CTGCATTCGA TCGGCGACGA CTTCGCCTGG CGCCTCGACA CCGCCAGCGT CGAAACGATG
ATGCGCAGTG CCGCAGAAAC GGCGCTGCCG ATCATGTGCT TCGTAGGCAA TCGCGGGGTC
ATCCAGATCC ACTCCGGTCC GGTCGTGAAG ATCGGGACGA TGGGGCCGTG GCTGAACGTC
ATGGACGAAA CTTTCCATCT GCATCTGCGC ACCGACCACA TCACCGAACT GTGGGCCGTG
CGCAAGCCGA CGGCGGACGG ACATGTGACA TCCGTCGAGG GGCTCGACGC CAAGGGCGAG
ATGATCATTC AGTTCTTCGG AAAGCGGAAG GAAGGGTCCT CGGAAAGGGC CGAATGGCGC
AGCCTGGCCG AGGGACTGCC GCGTCTGAAG ACCGTCGTCG CGGCCTGA
 
Protein sequence
MTMTESMRPT PAEIRAYRAE NPKLRERDIA AQLGISEAAL VAAECGLTAI RIESSANRFL 
ERAEELGEVL ALTRNESAVH EKVGLYENVK QGHAATLVLG SEIDLRVFPG AWEHGFAVTK
TDAKGEVRRS LQFFDKWGNA VHKVHLRPAS HLAAYEKLVE DLRLDDQSQD FIADPGAPAN
DDVTDDSVDT AELRDRWSKL TDTHQFPGML RKLKVGRRRA LHSIGDDFAW RLDTASVETM
MRSAAETALP IMCFVGNRGV IQIHSGPVVK IGTMGPWLNV MDETFHLHLR TDHITELWAV
RKPTADGHVT SVEGLDAKGE MIIQFFGKRK EGSSERAEWR SLAEGLPRLK TVVAA