Gene Smed_1786 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1786 
Symbol 
ID5322644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1868368 
End bp1870419 
Gene Length2052 bp 
Protein Length683 aa 
Translation table11 
GC content60% 
IMG OID640790724 
Productferredoxin 
Protein accessionYP_001327456 
Protein GI150396989 
COG category[R] General function prediction only 
COG ID[COG3894] Uncharacterized metal-binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00262051 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTGAACG TGCCTTCGAA GGACGAAAAG AACGAACCGC TGGTGCTCTT CATGCCATCG 
GGCAAACGCG GCCGCTTCCC GGTTGGCACG CCGATCCTCG ATGCCGCTCG CTCGCTCGGG
GTCTATGTCG AAAGCGTCTG CGGCGGTCGC GCCACCTGCG GGCGGTGCCA GGTGTCCGTC
CAGGAAGGCA ATTTCGCCAA GCACAAAATC GTCTCTTCCA ACGACCATAT CTCGCCATTC
GGGCCCAAGG AGCAGCGTTA CGCCAGCGTA CGTCAATTGC CCGACGGCCG CCGCCTATCG
TGCTCGGCCC AGATCCTCGG CGATCTCGTC ATAGACGTGC CGCAGGACAC AGTCATCAAC
GCTCAGGTGG TGCGCAAGGC CGCGACCGAC CGCGTTATCG AGCGCAATGC TGCAGTACAA
CTGTGTTATG TCGAAATCGA CGAGCCGGAT ATGCACAAGC CGCTCGGCGA TTTCGATCGG
ATGAAGGCCG TATTGGAGAA AGACTGGGGC TGGAAGGATC TCCTGATCGC TCCACACCTC
ATCCCACAGG TGCAAGGCAT ATTGCGCAAG GGAAATTGGA CGGTGACCGC AGCAATCCAC
CGCGACATGG ATTCCTCCCG TCCCTTTATC GTCGGGCTAT GGCCGGGGCT GAAGAACGAG
GCATATGGCG TCGCCTGCGA CATCGGCTCG ACGACGATTG CGATGCATCT CGTATCGCTG
CTGTCCGGAC GTATAGCCGC CTCCTCGGGA ACCTCGAATC CGCAGATCCG CTTTGGTGAG
GATCTGATGA GCCGCGTTTC TTACGTGATG ATGAACCCGG ATGGCCGGGA GGCAATGACC
AAGGCCGTGC GCGACGCCGT GAACGACCTC ATCGGCAAGG TTTGCGCCGA AGGCGAGGTC
GATCGCCACG ACATCCTCGA TCTGGTCTTC GTCGGCAATC CGATCATGCA TCATCTGTTT
CTCGGGATTG ATCCGACAGA ACTCGGACAG GCACCATTTG CCCTCGCCGT CTCCGGTGCC
CTACAATATT GGGCGCATGA GATCGACATC GAGGTCAACC GCGGCGCGCG CATCTATATG
CTTCCCTGTA TCGCCGGCCA TGTCGGAGCG GATGCCGCAG GTGCGACACT TTCCGAAGGG
CCGCACCGCC AGGACAACAT GATGCTGCTG GTCGACGTAG GGACCAATGC CGAAATCGTA
CTCGGCAACA AGGAGCGCGT CGTCGCGGCC TCCTCGCCGA CCGGCCCGGC GTTTGAAGGG
GCCGAGATTT CTTCCGGACA ACGTGCAGCA CCAGGGGCGA TCGAGCGCGT GCGCATCGAT
CCCGAGACTT TGGAGCCGCG GTTCCGGGTG ATCGGTGTCG ATAAATGGTC GGACGAAGAA
GGTTTCGCCG AAGCCGCCGC GGCAGTCGGT GTAACTGGAA TCTGCGGCTC GGCGATTATC
GAGGTCGTGG CGGAGATGTA CCTCACGGGC ATCATTTCGC AGGACGGCGT CGTCGACGGC
GCAATGGCGG CGAAAAGCCC CCGCATCATC CCGAACGGCC GCACCTTTTC CTACCTACTG
CACGATGGCG CACAACGAAT CACCGTGACG CAGAACGACA TCAGGGCGAT CCAGCTCGCC
AAGTCGGCGC TCTATGCCGG AATTAAGCTG CTCATGGAGA AACAGGGCGT CGATCACGTC
GACACGATCC GGTTTGCCGG CGCCTTCGGC TCCTTCATCG ATCCAAAATA TGCCATGGTG
CTGGGCCTGA TACCCGATTG CGACCTCACG GAAGTGAAGG CGGTTGGCAA TGCCGCCGGC
ACCGGCGCGC TGATGGCGCT CCTCAATCGC GGACACCGTC GCGAAATCGA GCAAACCGTG
AGGAAAATCG AGAAGATAGA GACGGCGCTT GAATCAAAAT TTCAGGAGCA TTTCGTCAAC
GCAATGGCGA TGCCGAACAA GGTGGATGCC TTCCCGAAAC TCGCCGAAGT GGTTACCTTG
CCGGCACGCA AGGTGCTGAC CGATGACGGT GGCGACGGAA GTGGACGCAG ACGGCGACGC
AACAGGGAAT AG
 
Protein sequence
MLNVPSKDEK NEPLVLFMPS GKRGRFPVGT PILDAARSLG VYVESVCGGR ATCGRCQVSV 
QEGNFAKHKI VSSNDHISPF GPKEQRYASV RQLPDGRRLS CSAQILGDLV IDVPQDTVIN
AQVVRKAATD RVIERNAAVQ LCYVEIDEPD MHKPLGDFDR MKAVLEKDWG WKDLLIAPHL
IPQVQGILRK GNWTVTAAIH RDMDSSRPFI VGLWPGLKNE AYGVACDIGS TTIAMHLVSL
LSGRIAASSG TSNPQIRFGE DLMSRVSYVM MNPDGREAMT KAVRDAVNDL IGKVCAEGEV
DRHDILDLVF VGNPIMHHLF LGIDPTELGQ APFALAVSGA LQYWAHEIDI EVNRGARIYM
LPCIAGHVGA DAAGATLSEG PHRQDNMMLL VDVGTNAEIV LGNKERVVAA SSPTGPAFEG
AEISSGQRAA PGAIERVRID PETLEPRFRV IGVDKWSDEE GFAEAAAAVG VTGICGSAII
EVVAEMYLTG IISQDGVVDG AMAAKSPRII PNGRTFSYLL HDGAQRITVT QNDIRAIQLA
KSALYAGIKL LMEKQGVDHV DTIRFAGAFG SFIDPKYAMV LGLIPDCDLT EVKAVGNAAG
TGALMALLNR GHRREIEQTV RKIEKIETAL ESKFQEHFVN AMAMPNKVDA FPKLAEVVTL
PARKVLTDDG GDGSGRRRRR NRE