Gene Smed_4880 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4880 
Symbol 
ID5318042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1387380 
End bp1388714 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content62% 
IMG OID640776665 
Productprotein of unknown function DUF900 hydrolase family protein 
Protein accessionYP_001313597 
Protein GI150377001 
COG category[S] Function unknown 
COG ID[COG4782] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.729214 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCCAATC GACGCAGCCG CGATTTTCTT GTCGCCTCAT CAGTTCCTGT GGTTCGCCTT 
CAGCGGATGA TCCTTTGCCT GGCAATTGCC GCTCTTTCCG GCTGCGGCGG GCATCCCAAA
GGCGTGCTGA CGCCTGTCGC CGACAGCGCG CCCACAGCGA GCCGGGTCGA CATGCTGATC
ACCACCACCC GCGGCCGTTC GGAGGTGGCC GGAGAGATGT TTACGGGAGA ACGAGCCCGC
GCACCGGCCT TCGCGAACAT CACCGTCTCG ATCCCGCCTG TCCGCAAGGC CGGAGAGGTT
GCCTGGCCGA AGAAATTGCC GTCCAATCCA GCCACCGATT TCGCGACTTT GAAGGCGGAC
GACCTGACCA GGGATGGAGC CAAGGACTGG CTCAACACCA CAGTCCGGAA AAGCCCCGAC
CGCAGTGTGC TCGTGTTCAT CCACGGCTTC AACAACCGCT TCGAGGACTC CGTCTACCGC
TTCGCTCAGA TCGTCCATGA TTCCGGCGTC CACAGCGCCC CTGTCCTGGT GACATGGCCG
TCGCGGGGCA GCCTGCTTGC CTATGGCTAC GACCGCGAAA GCACCAACTA CACCCGCAAC
GCACTCGAAT CGCTTTTCCA GTATCTGGCC GCGGATAAAG AGGTGAAGGA GGTATCGATC
CTCGCGCATT CCATGGGGAA CTGGCTCACG CTCGAGGCGC TTCGCCAGAT GGCCATCCGC
AATGACGGCC TGCCGGAAAA ATTCAAGAAC GTGATGCTTG CGGCTCCGGA TGTCGATGTC
GACGTCTTCC GTTCGCAAAT CGAGGACATG GGCAGGCAGC ATCCGCGGTT TACCCTGTTT
GTATCCCGCG ACGACCGGGC GCTAGCCTTC TCTCGCAGGG TCTGGGGCGA CATTCCCCGG
CTTGGTTCGA TCGACCCGGA GGCCGATCCC TACAAGCAGG AACTGGCGGA CAACGAGATC
ACCGTTATCG ATCTGACCAA GGTGAAGGCC GGCGACGGCA TGCATCACGG CAAGTTCGCA
GAATCCCCGG AAGTCGTCCG GCTCATCGGT GCACGCATCT CCGAAGGTCA GCCGTTGACC
GACAGCCGGA TGGGCCTCGG GGACCATCTC ATTGCGGGAA CGACGGGAGC AGCCGCTGCG
GCCGGCAGCG CGGCCGGCTT GATCCTTGCC GCTCCGGTCG CCGCCATCGA CCCGCACAGC
AGAGACAATT ATGCGAACCA CGTCGGTGCG GCGATGGGAC AGTCGCATGG CAAGCAGCAG
ATCGCGGTGA AAGACTGTTC GAAACAGGCG CGCGAGCGCG ATGCGGCGTC AACTTCACCG
TGTCGAAGCT GGTGA
 
Protein sequence
MANRRSRDFL VASSVPVVRL QRMILCLAIA ALSGCGGHPK GVLTPVADSA PTASRVDMLI 
TTTRGRSEVA GEMFTGERAR APAFANITVS IPPVRKAGEV AWPKKLPSNP ATDFATLKAD
DLTRDGAKDW LNTTVRKSPD RSVLVFIHGF NNRFEDSVYR FAQIVHDSGV HSAPVLVTWP
SRGSLLAYGY DRESTNYTRN ALESLFQYLA ADKEVKEVSI LAHSMGNWLT LEALRQMAIR
NDGLPEKFKN VMLAAPDVDV DVFRSQIEDM GRQHPRFTLF VSRDDRALAF SRRVWGDIPR
LGSIDPEADP YKQELADNEI TVIDLTKVKA GDGMHHGKFA ESPEVVRLIG ARISEGQPLT
DSRMGLGDHL IAGTTGAAAA AGSAAGLILA APVAAIDPHS RDNYANHVGA AMGQSHGKQQ
IAVKDCSKQA RERDAASTSP CRSW