Gene Smed_4882 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4882 
Symbol 
ID5318044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1391076 
End bp1392521 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content59% 
IMG OID640776667 
Producthypothetical protein 
Protein accessionYP_001313599 
Protein GI150377003 
COG category[S] Function unknown 
COG ID[COG5361] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.196066 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCACGA AACGCGATCT GCTTCACGCT GCTGCGATAG CCGCCGCGGT GGCCGCCACG 
GCGGCGAGGT CCACCCCAGC AATTGCCCAG GACAAAGCCG GATGGCCCAG CGTGCTGGAG
GCCAAGGATA TTGCCGAGGA AGGGCTCATC TATGGCTTGC CGCTGGTGAT GAATTACGCG
GTCATGCAGG AGTTCGCAGT CGACAGAAAC TCGGGGCAGT TCAAGGCACC TTTCAACGAA
ATCAACAACA TGCACCATGT CGCGACCCCC GCGGACACTG CAATCATAAC GCCGAACAGC
GATACCCCTT ACTCGTTTGT GTGGCTGGAT TTGCGCGCCG AGCCGATGGT TCTCTCGGTT
CCGGCGATCG ATAAGGACCG ATACTATTCG ATCCAGCTCA TCGACGGCAA CACCTATAAC
TTCGGCTATA TCGGCACGCG CGCCACGGGC ACCGAGCCGG GCGACTATCT GGTGGTCGGC
CCCGACTGGA AGGGTGAAAC GCCCGCCGGT ATCAAGAAGG TCTTCAGATC GACGACGCCG
TTCACGTTTA CCGCTATCCG CACGCAGCTC TTCAACCGCA ACGACATGCC GAGGGTCGAG
AAAATTCAGG CTGGCTACAC CGCGCAGCCT CTCTCCGCTT TCCTGAAACA ACCGGCTCCG
CCCGCATCGC CGAAAATCGA CTTCCTTCCA GCCACCACTG CAGGGATCAA GGACAACTTC
TTCCGATATC TCGATGCGTC CCTGCAATTC GTTCCTGAGA CGTCAAGGGA CAAGGCCATC
CGCGCGAGAC TCGCTAAGAT TGGCATTGGT CCGGGAAAGA CCTTCGAGTT CGAGGATCTG
TCGCTCGAAC ACAAGGACGC AATTCGCGTG GCCATGAAGC AGGGCAATGA CAAAGTCGAC
AAATGGCTGA CCAACGGAAA CAAAAATATC AACGGCTGGA ACATCGGCTC GTTCTTCGGT
GACGAAGCCT TCTTCAACGG TGATTGGATG ATGCGGGCCG GGGCTGCCAA GGGCGGTCTC
TATGGAAATG ATGCCGTTGA AGCCATGTAC CCCTACACCC GAACGGACAC GACCGGCGAG
CCGCTCGACG GCAGCAAGCA CAAGTACACA ATCACCTTCG CACCCGGCCA GTTGCCTCCG
GTAAATGCGT TCTGGTCCGT CACGATGTAC GACGGCAAGA GCCAGTTCCT GGTCAAGAAC
CCGATCGATC GCTACCTCAT CAACTCTCCG ATGTTGCCGG GGATGAAAAG GGCGCCGGAT
GGTTCGCTGA CGCTGTACAT TCAAAAGGAC AGCCCCGGTG CGGACAAGGA GGCAAATTGG
CTTCCAGCCC CGGATGGCAC GATTTATCTC GTGATGCGCC TGTACTGGCC GAAGCCTACG
CCACCCTCGA TTTTGCCGGC GGGCGAGGGG ACATGGCAGC CGCCCGGCGT GAAACGGGTC
TCGTAG
 
Protein sequence
MLTKRDLLHA AAIAAAVAAT AARSTPAIAQ DKAGWPSVLE AKDIAEEGLI YGLPLVMNYA 
VMQEFAVDRN SGQFKAPFNE INNMHHVATP ADTAIITPNS DTPYSFVWLD LRAEPMVLSV
PAIDKDRYYS IQLIDGNTYN FGYIGTRATG TEPGDYLVVG PDWKGETPAG IKKVFRSTTP
FTFTAIRTQL FNRNDMPRVE KIQAGYTAQP LSAFLKQPAP PASPKIDFLP ATTAGIKDNF
FRYLDASLQF VPETSRDKAI RARLAKIGIG PGKTFEFEDL SLEHKDAIRV AMKQGNDKVD
KWLTNGNKNI NGWNIGSFFG DEAFFNGDWM MRAGAAKGGL YGNDAVEAMY PYTRTDTTGE
PLDGSKHKYT ITFAPGQLPP VNAFWSVTMY DGKSQFLVKN PIDRYLINSP MLPGMKRAPD
GSLTLYIQKD SPGADKEANW LPAPDGTIYL VMRLYWPKPT PPSILPAGEG TWQPPGVKRV
S