Gene Smed_3777 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3777 
Symbol 
ID5318225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp224842 
End bp227052 
Gene Length2211 bp 
Protein Length736 aa 
Translation table11 
GC content64% 
IMG OID640775590 
Productaldehyde oxidase and xanthine dehydrogenase molybdopterin binding 
Protein accessionYP_001312523 
Protein GI150375927 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.541011 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATACCGA AACTAATGCA ATCGATGGCC CTTCCCGCTC GCGTCGAAGC CTCGCGCCGC 
CAATTCCTGC TCGGTGCGCT TGCCGCCGGC ACCGGCATCG CCGTTGGGTA CCGGCTCCTA
TCCGCTTCGC CGGCCCTTGC GGGCGAAGCG CAAGCGGGGA ATGATTCACA TGCGTTTTCC
CCATATCTGA CGATCGGCGG GGATGGGAAG GTCACCATTC TCTCGTCGCA GTTCGAGATG
GGCCAGGGCT CATATAACGG CATTGCCACG CTGGTGGCGG AGGAGTTGGA CGCCGACTGG
TCCACGATCG ACGTCAAGGG AGCCGCCGGC AACATCCCGG CCTACGGCAA TATCGCCTTT
GGAGGTACCA TGCAGGGTAC CGGGGGCTCG ACATCCATGT CGACCTCATG GGAGCGCTAC
CGCAAGGCTG GTGCCGCCGC CCGAGCCATG CTCATTGCCG CTGCTGCAGC GGAATGGAAG
GTGGACGCCG CGGAGATCAC CGTCGAGAAC GGCGTTCTTT CCCATCCGTC GGGCAAGAGC
GGTGGTTTCG GCGCGTACGC CGCCAAGGCG ACGACGATGC CGGTGCCGGC CGACGTGAAG
CTCAAGGAAC CGAGCGCCTG GAAATTTATC GGCAATGCCG AACTGAAGCG CTTCGACAGT
GCCCGCAAGG CAAACGGGAC CGAGCAATAC ACCATCGATG TCAAGCTGCC GGGAATGCTG
ACGGCAGTGA TGATTCACCC GCCCCTCTTC GGCGCCAAAG CCAAGTCCTT CGATGCTTCG
GCCGCCCGCG CCATCAAGGG CGTGGTGGAC GTGGTGGAAA CGCCACGCGG CATTGCGGTG
GTCGGCGAGC ATATGTGGGC GGCAATCAAG GGCCGTGAAG CGGTAACCGT CGAATGGGAC
GAGTCCGGCG CCGAAAAGCG GGGGACGGCG GAGCTGATGT CCACCTACCG CGACCTCGCG
GGCAAGACGC CGGCGGCCTT CGCGCGCAAG GACGGCGACG CCGAGGCTGC CTTCGCAGCA
GCCGCCAAGG TCATCGAGGC GACCTTCGAG TTCCCCTATC TGGCCCATGC CGCTCTTGAG
CCCCTGAATG CGGTTGCACG CAGGAACGAG GACGGCACGA TCGAGATCTG GGGCGGACAC
CAGCTCCCCG ACGTGTACCA GAAGCTCGCC AGCGAGATTG CCGGAGTGCC GGTCGAAAAT
GTCCGACTGA ACGTCATGAA GACCGGCGGC AGCTTCGGGC GGCGTGCCGT GTTCGACGGC
GACGTCGTGG TGGAAGCCGT ACATGTGGCC AAGGCCCTCG GCTTCCGTGC ACCGGTCAAG
GTGCAATGGA CGCGGGAGGA AGACACGCGC GCCGGCCGCT ATCGGCCCGC CTATGTACAT
CGCCTGAAAG CCGGGATCGA TGCAGACGGC AAACTCGTCG CCTGGAGCGA TCACATCGTC
GGCCAGTCAA TCGTGGCGAA AACGGCTTGG GATGGCATGG TTCAGAACGG CGTCGACCCG
ACGTCTGTGG AGGGGGCGAA CAATCTGCCA TATGCCATTC AGAACCAGAC GGTCGGGCTC
ACGACCACCG ATGTTCGCGT TCCGGTTCTC TGGTGGCGCT CCGTCGGCTC GACACACACC
GCCTTCGCGG CCGAAGCCTT TCTGGACGAA GTTGCCCAGG CTGCGGGACG TGACCCACTG
GAGTTTCGCC TTTCGATGCT GGAGCCGCAG TCGCGCCACG CAACCGTTCT CAAGCTGGCG
GCGGAAAAGG CGGAATGGCA GAAACCGCTG CCTGAAGGGC GCTTCCGCGG CGTTGCGGTC
GCCGAGAGCT TCGGTTCGGT CGTGGCACAG ATCGCCGAGG TTTCCACAGA TGGGAACGGC
ATTAAGGTCG AGCGCGTTGT CGCTGCGGTC GATTGCGGTC TCGCGATCAA TCCGGATCAG
GTGCGCGCTC AGGTCGAAGG CGGAATAGGC TTTGGCCTCA GCGCCATCCT GGGCGAGGAG
ATCACGCTGA CGGATGGCAA GGTCGACCAG GGCAATTTCG ACATGTACAC GCCGCTCAGG
ATCGATGCGA TGCCGAAGGT CGAGGTTCAT ATCGTCGCCT CGTCCAATCC TCCTTCGGGG
ATCGGCGAGC CCGGCGTTCC GCCGATCGGC CCCGCAGTTG CCAATGCCGC CTTCAAAGCT
CTCGGCAAAC GGATACGTGT CATGCCGTTC GCGAAGTCGC TTAACGCCTG A
 
Protein sequence
MIPKLMQSMA LPARVEASRR QFLLGALAAG TGIAVGYRLL SASPALAGEA QAGNDSHAFS 
PYLTIGGDGK VTILSSQFEM GQGSYNGIAT LVAEELDADW STIDVKGAAG NIPAYGNIAF
GGTMQGTGGS TSMSTSWERY RKAGAAARAM LIAAAAAEWK VDAAEITVEN GVLSHPSGKS
GGFGAYAAKA TTMPVPADVK LKEPSAWKFI GNAELKRFDS ARKANGTEQY TIDVKLPGML
TAVMIHPPLF GAKAKSFDAS AARAIKGVVD VVETPRGIAV VGEHMWAAIK GREAVTVEWD
ESGAEKRGTA ELMSTYRDLA GKTPAAFARK DGDAEAAFAA AAKVIEATFE FPYLAHAALE
PLNAVARRNE DGTIEIWGGH QLPDVYQKLA SEIAGVPVEN VRLNVMKTGG SFGRRAVFDG
DVVVEAVHVA KALGFRAPVK VQWTREEDTR AGRYRPAYVH RLKAGIDADG KLVAWSDHIV
GQSIVAKTAW DGMVQNGVDP TSVEGANNLP YAIQNQTVGL TTTDVRVPVL WWRSVGSTHT
AFAAEAFLDE VAQAAGRDPL EFRLSMLEPQ SRHATVLKLA AEKAEWQKPL PEGRFRGVAV
AESFGSVVAQ IAEVSTDGNG IKVERVVAAV DCGLAINPDQ VRAQVEGGIG FGLSAILGEE
ITLTDGKVDQ GNFDMYTPLR IDAMPKVEVH IVASSNPPSG IGEPGVPPIG PAVANAAFKA
LGKRIRVMPF AKSLNA