Gene Smed_6000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_6000 
Symbol 
ID5320302 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp956458 
End bp957981 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content60% 
IMG OID640777676 
Productintegrase catalytic region 
Protein accessionYP_001314608 
Protein GI150378013 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.234482 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGAAGG TAAGCATGGC GACACGTGCG GAATTGGTTG CGGCGATCAG TTGTCGCTAT 
GTATTAGGTG GGCGGGCCGA GAAGGCGAGG ATGTTAGACG AGTTCGTGGC GCTCACGGGC
TTTCACCGCA AGCATGCGAT GCGACTGCTG CGAGGAGACC GCGAACCGGC GAAGGGTGGT
CCTCGGCCAG GGCGCCGGGT TTACGGCGAT GACGTGCGGG CAGCGCTCGT CGGTGTTTGG
GAGGCGTCGG ATCGAATTTG CGGCAAGCGA CTACACCCCC TGTTGCCAAC ACTGATCGAA
GCGATGGAAC GTCATGGACA CGGCGATATG AATTGCGAGA CGCGCCGGCA ACTCTTGGCG
ATGAGCCCAG CGACGATTGA TCGGGTCCTC AAGGAGATCA AAGCGAGCGC CACGGGTCCG
CGGCGCCGGA AAGGGTCAAC AGCGATACGG CGTAGTGTTC CCGTTCGAAC CTTCTCGGAT
TGGGATGACC CCGCACCCGG CTTTGTCGAG GCTGATCTCG TTTCTCATTC CGGCCCGTAC
GCGAGGGGTG CCTTCTCTCA AACGCTGGTG TTGACCGATA TAGCCACGGG CTGGACGGAA
TGCGCGCCGC TGCTGGTTCG CGAGCAAACG GTACTGATCA CTGCGTTGGC CGAACTGCGC
AAGTTGCTGC CGTTCCCGCT GCTGGGCTTC GACACCGACA ACGACAGTGT ATTCATGAAC
GAGAGTGTTC ATGAGTATTG CTTGCGCGAT AATATCGAGC TCACCCGTTG CCGCCCCTAC
CGAAAGAATG ATCAGGCATT TGTCGAGCAG AAGAATGGCG CGATCGTGCG CAAGATCGTT
GGATACCGAC GCTTCGAGGG GCTGCGAGCC ACTCGGGAAC TGGCCAAGCT TTATTCGTCA
ATGCGGTTGT TCGTGAATTT CTTTCAGCCA TCATTCAAGC TGAAAGAAAA GCACCGTGAC
GGAGCCAAGG TGGTCAAGCG CTATCATCGT CCCGCCACGC CCTATCAGCG GCTGCTTGAC
GACCCACGCA CGCCGGAGGA TACATGCCTT CGGCTCAAGG CGATGTACCT GACACTCGAT
CCGGTCCGGC TGCTCCGCGA CATACGCCTG GCACAAACGA GATTGGTCGA AATTGCTGAC
AAGCCTGATG GTTCGCCTGC CACCGACGGC GAGGCATTAC CGCTCGAAGA CTTTCTGTCT
GGCTTACGGA TTGCTTGGCG TGGTGGTGAA GTGAAACCGA CTGCCCGCTC CAAGCCAGCG
GCCAAGCGAG AGCGGCGGAG GCCCGATCCT CTACTCGCCG TCACTGCCGA ACTCGAGGAA
TGGTTCGAGG CGGAGCCTTG GCGAACTTCG CGAGAGTTGC TTGAACGCTT GCAGGTCAAA
TACCCCGGCG TGTATCCCGA CCGCCTCATT CGGACCGTGC AGCGTCGAAT GAAGATCTGG
CGCAGTACAC AGGCCAATGC GCTGGTGTTC GGGCCATTCG CCGATGTCGC GCGGCAGACG
CAAATCGTAG AGGTCGTGCA GTGA
 
Protein sequence
MRKVSMATRA ELVAAISCRY VLGGRAEKAR MLDEFVALTG FHRKHAMRLL RGDREPAKGG 
PRPGRRVYGD DVRAALVGVW EASDRICGKR LHPLLPTLIE AMERHGHGDM NCETRRQLLA
MSPATIDRVL KEIKASATGP RRRKGSTAIR RSVPVRTFSD WDDPAPGFVE ADLVSHSGPY
ARGAFSQTLV LTDIATGWTE CAPLLVREQT VLITALAELR KLLPFPLLGF DTDNDSVFMN
ESVHEYCLRD NIELTRCRPY RKNDQAFVEQ KNGAIVRKIV GYRRFEGLRA TRELAKLYSS
MRLFVNFFQP SFKLKEKHRD GAKVVKRYHR PATPYQRLLD DPRTPEDTCL RLKAMYLTLD
PVRLLRDIRL AQTRLVEIAD KPDGSPATDG EALPLEDFLS GLRIAWRGGE VKPTARSKPA
AKRERRRPDP LLAVTAELEE WFEAEPWRTS RELLERLQVK YPGVYPDRLI RTVQRRMKIW
RSTQANALVF GPFADVARQT QIVEVVQ