Gene Smed_5431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5431 
Symbol 
ID5319733 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp389181 
End bp390356 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content67% 
IMG OID640777195 
Productphage integrase family protein 
Protein accessionYP_001314127 
Protein GI150377532 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.606096 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTCGATA ATCATAGAGC GGCCTATTTC GACAGCCCAG CGGTCCACCG ACGCGCCGAA 
GAGCTCGACG CGCTCGACGC CATCCTACCG TTCGACCGAC GCGACCGGCT CGCCGCGCTG
CTGACCGACG ACGATGTCGC GACCCTGAAA CATCTCGCGA GCGAAGGCAT GGGCGAGAAC
ACGCTGCGGG CACTCGCCTC CGATCTCGGC TATCTCGAGG CCTGGTGCCA GCTTGCCACC
GGTTCCCCCC TCCCCTGGCC GGCGCCGGAA GCACTGCTCC TGAAGTTCGT CGCCCATCAC
CTCTGGGATC CGGTCAAGCG CGCCGAGGAC CCGGCCCACG GCATGCCGGC CGAGGTCGAG
GCCGGATTGC GTGCCGAACG CCTGCTGAGG GCCGACGGAC CGCACGCGCC CGGCACGGTG
CGGCGCCGGC TGACCTCCTG GTCGATCCTG ACCCGCTGGC GCGGTCTCAC CGGCGCCTTC
GGTGCGCCAT CGCTGAAGAG CGCGCTGAGG CTGGCAGTCA AGGCGAGCAA CCGGCCGCGC
CAGCGCAAGA GCAAAAAGGC AGTGACCGTC GATATCCTGG CGAAACTGCT TCAGGCTTGT
GCTGGCGATC GGCCGGTTGA CCTCCGCGAT CACGCGCTGC TCCTCACCGC CTTTGCCTCC
GGCGGCCGTC GCCGCTCGGA AGTGGCGGCT CTGCGCGTCG AGGATCTCGC CGACGAGGAA
CCGGTCCGCG CGGATCCCTC CGACAAGACC TCCCCTCCCC TGCCCTGTCT GTCGATCCGC
CTCGGCCGCA CCAAGACGAC GACCGCCGAT GAGAACGAAC ATGTGCTGTT GATCGGCCGT
CCAGTAGCTG CCCTGAAAAC TTGGCTGGCT GAAGCGCTAA TCAAGGACGG CCCGGTGTTC
CGGCGCATCG ATCAGTGGGG CAATATCGAC CTGCGGGCGC TGACGCCGCA GTCTGTCAAT
CTGATCCTGA AAGCACGCTG TGAACAGGCC GGCCTCGATC CGGCGCTGTT TTCGGCGCAC
GGCCTAAGGT CCGGCTATCT GACCGAGGCG GCAAATCGTG GTATCCCGCT GCCCGAGGCG
ATGCAGCAGT CGCTGCACAA ATCGGTGACC CAGGCGGCCA GCTACTACAA CAACGCGGAA
CGAAGGAATG GGCGAGCGGC CCGGCTGATC GTCTGA
 
Protein sequence
MVDNHRAAYF DSPAVHRRAE ELDALDAILP FDRRDRLAAL LTDDDVATLK HLASEGMGEN 
TLRALASDLG YLEAWCQLAT GSPLPWPAPE ALLLKFVAHH LWDPVKRAED PAHGMPAEVE
AGLRAERLLR ADGPHAPGTV RRRLTSWSIL TRWRGLTGAF GAPSLKSALR LAVKASNRPR
QRKSKKAVTV DILAKLLQAC AGDRPVDLRD HALLLTAFAS GGRRRSEVAA LRVEDLADEE
PVRADPSDKT SPPLPCLSIR LGRTKTTTAD ENEHVLLIGR PVAALKTWLA EALIKDGPVF
RRIDQWGNID LRALTPQSVN LILKARCEQA GLDPALFSAH GLRSGYLTEA ANRGIPLPEA
MQQSLHKSVT QAASYYNNAE RRNGRAARLI V