Gene Smed_5354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5354 
Symbol 
ID5319656 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp317774 
End bp319972 
Gene Length2199 bp 
Protein Length732 aa 
Translation table11 
GC content63% 
IMG OID640777127 
Productaldehyde oxidase and xanthine dehydrogenase molybdopterin binding 
Protein accessionYP_001314059 
Protein GI150377464 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.246739 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.260474 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATTCG ACACACCAGC CACGACGAAC CCGATCGACC AGGGCAAGGT CGTCGGCAAG 
CCCATCCACC GCATCGACGG TCCGCTCAAG ACCACCGGCA AAGCCGTCTA TGCCTATGAG
TGGCACGATC GGAATACCCG CTATGCCTAT GGCTATGTGG TCGGCTCGGC AATCGCCAAG
GGCCGGATCA GGTCCATGGA TACCGCCGCC GCGAAGAAGG CCCCGGGCGT GATCGCGGTC
GTAACGACAG AAGCGGTGGG GGAACTGAAG AAAGGAAAAT ACAACACGGC CAAGCTCTTC
GGCGGCATGG AGATTCAGCA CTATCACCAG GCGATCGCCG TCGTGGTCGC GGAAACATTC
GAGGAGGCTC GCGCGGCCGC CGCGCTGGTC AAGGTCGATT ATGCCGAGGA AAAGGGCGCA
TTCGACCTGG CGGCGGCAAA GGACCGCGCC GTCGAGCCGG AAGGAGGGGG TGCCTCCGGA
GCGGGCGACT TCGATGCCGG CTTCGAGGCA GCGCCCGTTA AGCTCGACCA GATCTATACG
ACCCCGGATC AGTCGCATGC GATGATGGAG CCGCACGCCT CCATCGCCGC CTGGAATGGC
GATGACCTGA CTGTCTGGAC GTCGAGCCAG ATGATCGACT GGTGGCGGAC CGATCTCGCG
ACGACGCTCG GAATCGACAA GGAGAAAGTC CATATCATGT CGCCCTTCGT GGGCGGTGGT
TTCGGAGTCA AGCTGTTCCT GCGAGCAGAT GCGGTGCTTG CCGCGCTCTC CGCCCGCGAA
GCCGGGCGCC CCGTGAAGGT CGCCCTGCCC CGCCCCTTCC TGATGAACAA CACGACGCAC
CGGCCGGCGA CGATCCAGAG AATCCGCATT GGCGCAGGTC GCGATGGCAA AATCACTGCG
ATCGGTCATG AAAGCTGGTC CGGGGACCTC CCGGGCGGAG GACCCGAAGT GGCCGTTCAG
CAGACGCGGC TGCTTTACGC CGGCGAGAAC CGGATGACGG CCATGCGGCT TGCAACACTC
GATCTTCCCG AAGGAAATGC GATGCGCGCA CCGGGCGAAG CCCCGGGCAT GATGGCGCTG
GAGATCGCGA TCGACGAGAT GGCCGAAAAG CTCGGTCTCG ATCCCGTCGA ATTCCGTATC
ATCAACGATA CTCAGGTCGA TCCGGAGAAT CCGGAAAGAC CTTTTTCGCA CCGCAATCTC
ATCGGTTGCC TGCGCACCGG AGCGGAGCGC TTCGGCTGGC GGGAACGGAG CAAGAAAGCC
GGGGCTCGTC GGGAAGGAGA CTGGCTTGTC GGTATGGGAG TGGCCGCCGC GTTCCGCAAC
AATCTTGTGC TGCCCTCGGC AGCCCGCATC CGGCTCGACC GCGAGGGCAT CGTCACGGTC
GAAACCGACA TGACCGATAT CGGCACGGGC AGCTACACGA TTATCGCCCA GACCGCTGCA
GAAATGCTAG GCGTTCCGAT CGAAAAGGTC GCCGTCAGCC TGGGAGACTC GCGCTTCCCG
GTCTCCTCGG GTTCGGGCGG GCAGTTCGGG GGGAACTGCT CCACGGCAGG CGTATACGCC
GCTTGCGTCA AGCTACGCGA AGCAGTGGCG CAAAAGCTAA GCTTCAACAG CGCCGAAGAT
CTCATCTTCG CAGAGAGTGA AGTCCGATCA GGAGATCGCC GCATGCCGCT GGCGCAGGCC
GTAGCCGATG AGGCGCTCGT GGCGGAGGAC CGGATCGAGT TCGGCGATCT CACCAAAACC
CATCAGCAGT CCACCTTCGG CGCGCATTTT GTCGAGGTCG GCGTGGACGT AGCGACAGGA
GAGACCCGGA TCCGGCGGAT GCTGGCAGTC TGCGCCGCCG GCCGGATCCT CAATCCAATT
ACTGCGCGCA GTCAGGTGAT CGGCGCGATG ACGATGGGCG TTGGCGGTGC CTTGTCGGAG
GAGCTGGTCG TCGATAAGGA GCGCGGCTTC TTCGTCAACC ACGACCTTGC CGCTTACGAA
GTGCCAGTCC ACGCCGATAT ACCGCACCAG GACGTTGTCT TCCTCGACGA GACCGATCCG
ATGTCCTCGC CGATGAAGGC GAAGGGCGTC GCGGAGCTCG GCATCTGCGG TGTCGCCGCA
GCTGTAGCGA ACGCCATCTA TAACGCGACC GCCATAAGGG TGAGAAATTA TCCAATCACG
CTCGACAAGC TGATCAAAGA GCTTCCGGAA ATCAGTTAG
 
Protein sequence
MRFDTPATTN PIDQGKVVGK PIHRIDGPLK TTGKAVYAYE WHDRNTRYAY GYVVGSAIAK 
GRIRSMDTAA AKKAPGVIAV VTTEAVGELK KGKYNTAKLF GGMEIQHYHQ AIAVVVAETF
EEARAAAALV KVDYAEEKGA FDLAAAKDRA VEPEGGGASG AGDFDAGFEA APVKLDQIYT
TPDQSHAMME PHASIAAWNG DDLTVWTSSQ MIDWWRTDLA TTLGIDKEKV HIMSPFVGGG
FGVKLFLRAD AVLAALSARE AGRPVKVALP RPFLMNNTTH RPATIQRIRI GAGRDGKITA
IGHESWSGDL PGGGPEVAVQ QTRLLYAGEN RMTAMRLATL DLPEGNAMRA PGEAPGMMAL
EIAIDEMAEK LGLDPVEFRI INDTQVDPEN PERPFSHRNL IGCLRTGAER FGWRERSKKA
GARREGDWLV GMGVAAAFRN NLVLPSAARI RLDREGIVTV ETDMTDIGTG SYTIIAQTAA
EMLGVPIEKV AVSLGDSRFP VSSGSGGQFG GNCSTAGVYA ACVKLREAVA QKLSFNSAED
LIFAESEVRS GDRRMPLAQA VADEALVAED RIEFGDLTKT HQQSTFGAHF VEVGVDVATG
ETRIRRMLAV CAAGRILNPI TARSQVIGAM TMGVGGALSE ELVVDKERGF FVNHDLAAYE
VPVHADIPHQ DVVFLDETDP MSSPMKAKGV AELGICGVAA AVANAIYNAT AIRVRNYPIT
LDKLIKELPE IS