Gene Smed_5907 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5907 
Symbol 
ID5320209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp872130 
End bp873632 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content64% 
IMG OID640777602 
Producthypothetical protein 
Protein accessionYP_001314534 
Protein GI150377939 
COG category[S] Function unknown 
COG ID[COG3333] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.586249 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.273775 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACACCC TCAATGCGCT TCTCGGCGGC TTCGGCTCCG CGCTCTCTCC GATGAACCTG 
CTATGGGCGC TCCTCGGCGT CACGCTCGGC ACGTTCATCG GCGTATTGCC GGGTCTTGGT
CCCGCGCTGA CCATCGCACT TCTGCTGCCC ATAACCTTCC AGGTCGATCC GGCCGCGGCG
TTCATCGTCT TCGGCGGCAT CTATTTCGGC TCGCAGTTCG GCGGCTCGAC GACCTCGATC
CTCATCAACA CGCCCGGTGA AAGCGCCTCG ATCGTGACCG CGCTGGAAGG CAACCGAATG
GCCCGTAACG GCCGCGGCGC GCCAGCCCTT GCGACTGCTG CGATCGGCTC CTTTGTCGCC
GGCACGATCG GCGTCGTCTG TCTGAGCCTC CTCGCGCCCG TGGTCGTCAA GCTGGCGCTC
GCCTTCGCGC CGGCAGACTA TTTCGCGCTG ATGGTTCTGT CCTTCGTCAC CGTCGCCGCC
GTGCTGGGCA ATTCCGTCAT ACGAGGACTT ACCAGTCTCA GCCTCGGCCT CCTGCTCGGC
CTCGTCGGTG TGGATCTGCA ATCAGGCCAG GCCCGCTTCA CATTCGGCGC GCTCGACCTG
CTGGACGGCA TCGACGTGAT CATCGTCGTC GTCGGACTTT TTGCGGTCGG CGAGACGCTG
CATCTCGCCA CCCGCTACCG CTCCTCCCCG GAAGAGATCA TTCCGGTGAA GGGCTCCATG
TGGATGACGG CGCAGGACTG GGCACGCTCC TGGAAAGCCT GGATCCGCGG CGCGCTGATC
GGCTTTCCCA TCGGTGCGAT GCCCGCAGGG GGGGCCGAGA TTCCGACCTT TCTCTCCTAT
TTCGTCGAAA AGAAGCTCTC GAAACATCCG GAAGAATTCG GCCATGGGGC GATTGAGGGC
GTCGCCGGTC CGGAAGCCGC GAACAATGCG GCGGGAGCCG GCGTCTTCGT GCCGCTGCTG
ACGCTCGGCA TTCCGACCTC GGCGACGGCC GCCGTCATGC TGTCGGCCTT CCAGAGCTAT
GGCATCAACC CCGGTCCGCA ACTCCTGACC AGCCACGCCG ATCTCGTATG GACGCTGATC
GCCAGCCTCT ATATCGGCAA CGTGATGCTG CTTATCCTGA ACCTGCCGCT CGTCGGGCTC
TGGGTGCAGA TCCTTCGCAT TCCGACGCCC TATCTTTATG GCGGCATCCT GCTCTTCGCG
ACCGTAGGCA CCTACGGCAT CAGCCGTTCG GTCTTCGACC TCGTCATGCT CTATGCCATC
GGGCTGGCCG GCTTCTTCAT GCGGCGCTAC GATTTTCCGA CCAGCCCCGT GATCATCGGC
ATGATCCTCG GACCCCTCGC CGAGCAGCAG TTCCGCCGGG CCATGACCAT GTCGCAGGGG
GATCTCTCGG TCTTCGTCGC AAGGCCGATT TCAGCAAGCT TGCTTGTACT CGCCTTCATC
GCCCTCACGG CACCCATCGT CCTGTCCTTC CTCCGCAGCC GCCGGGAAAC GGCTGCCGCC
TGA
 
Protein sequence
MDTLNALLGG FGSALSPMNL LWALLGVTLG TFIGVLPGLG PALTIALLLP ITFQVDPAAA 
FIVFGGIYFG SQFGGSTTSI LINTPGESAS IVTALEGNRM ARNGRGAPAL ATAAIGSFVA
GTIGVVCLSL LAPVVVKLAL AFAPADYFAL MVLSFVTVAA VLGNSVIRGL TSLSLGLLLG
LVGVDLQSGQ ARFTFGALDL LDGIDVIIVV VGLFAVGETL HLATRYRSSP EEIIPVKGSM
WMTAQDWARS WKAWIRGALI GFPIGAMPAG GAEIPTFLSY FVEKKLSKHP EEFGHGAIEG
VAGPEAANNA AGAGVFVPLL TLGIPTSATA AVMLSAFQSY GINPGPQLLT SHADLVWTLI
ASLYIGNVML LILNLPLVGL WVQILRIPTP YLYGGILLFA TVGTYGISRS VFDLVMLYAI
GLAGFFMRRY DFPTSPVIIG MILGPLAEQQ FRRAMTMSQG DLSVFVARPI SASLLVLAFI
ALTAPIVLSF LRSRRETAAA